PCX 



WORLD IN l LLLhC'l UAL PROrhRTV OKGANI/A'nON 
IiitL-niaiioEUil Muicai) 




INHiRNATIONAL APPMCA riON PIJBLISHI:D [INni:R TlUl PATIiNT CT)OPP:RAnON rREATY (PCT) 



(51) International Patent Classificiition ^ : 




(11) International Publication Number: 


wo 98/16645 


C12N 15/31, C07K 14/35, 16/12, C120 
1/68, C12N 15/62, COIN 33/53 


A2 


(43) International Publication Date: 


23 April 1998 (23.04.98) 



(21) International Application Number: PCT/US97/18214 

(22) International Filing Date: 7 October 1997 (07. i 0.97) 



(30) Priority Data: 

08/729,622 
08/81R.i 11 



il Ortobe.'' 1996 (11.10.96) 
13 March 1997 (13.03.97) 



US 
US 



(71) Applicant: CORIXA CORPORATION fUS/US]; ] 124 Colum- 

bia Sta^^t. Seattle, WA 98104 (U'S). 

(72) Inventors: RI^HD. Steven. G.. 2843 - 122nJ Place U.E.. 

BellevLie, WA 98(X)5 (US), SKEIKY, Yasir, A., W.; 8327 
- 25th Avenue N.W., Seattle. WA 98107 (US). DILLON, 
D;iviii. C; 21607 N.E. 24th Street, Rcxlmond, WA 98053 
(US). CAMl'OS-NErrO, Atuoaio; 9308 Midship Coun 
N.H., Hainbridgc Island, WA ^)802l (US). HOUGHTON. 
Raymond; 2636 M2nd Phcc S.E., Bothell. WA 98021 
(US). VEDVICK. Thomas, S . 124 South 300th Place, 
Federal Way, WA 98003 (US) TWARDZIK. Daniel, R.; 
10195 South Beach Drive, Bambndge Island. WA 98110 
(US). LODES, Michael, J.; 9223 - 36th Avenue S.W., 
Settle, WA 9812fi (US). 



(74) Agents: MAKI, David, J. ct al.. Seed and Bcriy LLP, 
6300 Columbia Center, 701 Fifth Avenue, Seattle, WA 
98104-7092 (US). 



(81) Designated States: AL, AM, AT, AU, BA, BB, BG, BR, BY, 

CA, CM, CM, CU, CZ, DH, DK, EE, ES, Fl, GB, GE, GIf, 
ilU, ID, IL, IS, JP. KE, KG, KP, KR, KZ, LC. LK, LR, Ls] 
LT, LU. LV, MD, MO. MK. MN. .MW, MX, NO, NZ, PL, 
PT, RO, RU, SD, SE. SG. SI, SK, SL, TJ, TM, TR, TV, 
UA, UG, UZ, VN, YU. ARIPO patent (GH, KE,'lS, kfSV,' 
SD. SZ. UG. ZW). Euia.sian patent (AM, AZ, BY. KG, KZ, 
MD, RU, TJ, TM). [European patent (AT. BE. CH, DE, DK, 
ES. FI, FR, GB, GR. IE. IT, LU. MC:, NL, PT, SE), O^UI 
patent (BE, BJ, CF, CG. CI. CM, GA. ON, ML. MR. NE, 
SN. TD, TG). 



Published 

Withoul ifUernalional search report ant 
.:pon receipt of that report. 



d to he 1 epublis^%d 



(54) Title: COMPOUNDS AND METHODS I-OR DIACJNOSIS OI- rU'BKRCHILOSIS 



L! 

1.6^ 

^4- 

1.2- 

\ 

0.8 

0.6- 

O.i 

0.2 

0 - 



38kd 

'///. 



POSiTIVCS 
n=5 



i 



58kd NEGATIVES 



PPD+ 
n=tO 



NORMALS 
n=18 



\I 








I s 








SI 




\ M 




II 




1,1 








SK 




A I 




IK 




1 I 








,s\ 




\l 




(;a 




1 \ 


1 . ir, u 






S/ 


Sa s/i l.inJ 


A/ 


A :bii[ in 


<;b 




\1( 








II) 


[ ti.id 


IIA 




<;i: 




\tl) 


Rr;iiihlir 


)i ^^ 




K. 




hit 




(,M 




M(. 


M.iLl,i;',2St 






rj 




1(1 




(;n 


( lUlllr.M 


MK 




Yti 


;.Kl.r. 


1 \i 


lutkr.i.n.s 


Mi 




(,K 












I \i 


1 .[ir^ 


lu, 




III 




\^] 








1 1 


lMn),l,i.| .1 


It,) 




[) 




M\ 








1 \ 




MR 




n 




MK* 












11^ 





















wo 98/16645 



PCTyUS97/18214 



1 

COMPOUNDS AND ML THODS FOR DIACiNOSlS OF TUBERCULOSIS 

THCIINICAL FIELD 

^Fhc present invention relates generally to the detection of Mycohactcrium 
tuberculosis infection. The invention is more particularly related to polypeptides comprising.* 
a Klycohactenum tuberculosis antigen, or a portion or other variant thereof and the use of 
such poIypcpLidc^ for the serodiagnosis o( Mycobacterium tuberculosis infection. 

BACKCJROIIND Of I MF: IN VF;N110N - . 

Tuberculosis is a chronic, infectious disease, that is generally caused by 
infection with Mycobacterium tuberculosis. It is a major disease in developing countries, as 
well as an increasing problem in developed areas of the w^orld, with about 8 million new 
cases and 3 million deaths each year. Although the infection may be asymptomatic for a 
considerable period of time, the disease is most commonly manifested as ah acute 
inflammation of the lungs, resulting in fever and a nonproductive cough. If left untreated, 
serious complications and death typically result. 

Although tuberculosis can generally be controlled usmg extended antibiotic 
therapy, such Treatment is not suiTicient to prevent the spread of the disease. Infected 
indi\iduals ma\ he as\'mp!omatiL\ hiil contagious, for some tmic hi additu>n. although 
comphancc with tlic tivatmcnt rvginicu i.^ critical. [laliciU l^cha\u>r is dilficult lo rnonildr 
Su;iK' paticnls do nni cooij^lctc the couisc ol in'.'ilnicnl. winch can Il\k1 Ut ifU'lfccti\c 
trcUiTK'iit and the di'vcldpniciit of dius' rcNistaiicc 

Inhibiting ihc spread of Uihcrcuh)s(s udl require clfectue vaccinatKni and 
accurate, car]\ diagnosis of the disease C urrcndw \ accination with h\c bacteria is the most 
ctticicnt method (or inducing proiccti\c !ninuinil\ Ihc most coninitMi M\ cobaclenum iov 
ihi^ piirfinsc I . liaL-ilii;-^ ( 'almvMi- ( -m.- ■ i^a a : 
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site by 48-72 hours after injection, which indicates exposure to Mycobacterial antigens. 
Sensitivity and specificity have, however, been a problem with this test, and individuals 
vaccinated w^ith RCCi cannot be distinguished from infected individuals. 

While macrophages have been shown to act as the principal effectors of 
M. tuberculosis immunity, T cells are the predominant inducers of such immunity, l^he 
essential role of T cells in protection against A/, tuhcrcuiosis infection is iliustraLcd bv the 
frequent occurrence of A/ tuherculosis in AIDS patients, due to the depletion of CD4 T cells 
associated with human immunodeficiency \'irus (MIV) infection. Mycobacterium-reactivc 
CD4 T cells have been shown to be potent producers of gamma-interferon (IFN-y), which, in 
turn, has been shown to trigger the anti-mycobacterial effects of macrophages in mice. Whik 
the role of If N-y in humans is less clear, studies have shown thai 1 ,25-dihydroxy-vitamin D3, 
either alone or in combination with If\-y or tumor necrosis factor-alpha, activates human 
macrophages to inhibit M. tuberculosis itifection. l-urthermore, it is known that IPN-y 
stimulates human macrophages to mcike 1 ,25-dihydroxy-vitamin D3. Similarly, IL-12 ha.s 
been showai to play a role in stimulating resistance to A/ (ubcrculosis infection. Vox a review 
of the immunology ol' M. tuberculosis infection sec C'han and Kaufmann, in Tuberculosis: 
Piithos^enesis, Prolectum and (^ontro/. Bloom (ed.), ASM Press, Wasliington. DC\ 1994. 

Accordingly, there is a need in the art for impr-oved diagnostic methods foi* 
detecling tuberculosis, [ he present iin LMilioii tuirills this need aiul tiirther pro\ ides otluT 
['elated ad\'araaue>. 

SI AlMAin- ( H III! I\\ 1 NflON 

Briell) slated, tlu- piesenl iineiilion proNules composuions and metlioils foi 
diagnosing tuberculosis. In one aspect. pol\ [u-plides aie [novideJ coinpiising an aiiiiL'cnie 
portion ti) a sohiMe M /;//nvc///r»ws jntieen. j \aiiani sueii an antiuen that dilfers nu\\ 
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(b) Ala-Val-Glu-Scr-Gly-Mct-Leu-Ala-Leu-Cily-Jhr-l>ro-Ala-Pro-Ser 
(SBQ ID NO: 116); 

(c) Ala-Ala-Mcl-Lys-Pro-Arg-Thr-GIy-Asp-GIy-Pro-Leu-Glu-Ala-AIa- 
Lys-Glu-GIy-Arg (SVQ ID NO: 117); 

5 (d) lyr-lyr-Trp-Cys4^ro-Gly-Gln-Pro-PliL-Asp-Pio-Ala-Trp-GI)-Pro 

(SHQ ID NO: 118); 

•(e) Asp-Ile-(}Iy-Scr-Glu-Scr-rhr-(iIu-Asp-Gln-Gln-Xaa-Ala-Val (SF:Q ID 
NO: 119); 

(0 Ala-(}]u-(}Iu-Scr-Ilt-Ser-Thr-Xaa-C;iu-Xaa-Ilc-Val-Pro (SHQ ID 
10 NO: 120); 

(g) Asp-Pro-Cilii-i'ro-Ala-Pr()-Pi-oA^aNPm-rhr-Thr-Ala-Ala-Scr-Pro-I^ 
Scr{SP;Q ID NO: 121): 

(h) Ala-Pro4.ys-Thr-TYr-Xaa-Cilu-C]lu-Lcu-Lys-C}ly-rhr-Asp-7'hr-Gly 
(SFQ ID NO: 122); 

15 (i) Asp-Pro-Ala-Scr-Ala-I^t>-AspA^a]-Pm-Thr-Ala-Ala-(iln-Lcu-'rhr-Ser- 

Lcu-LcLi-Asn-Scr-Lcu-Ala-Asp-Pm-Asn-Val-Scr-Phc-Ala-Asn (SHQ 
ID NO: 123): 

(j) Xaa-Asp-Scr-Cilu-Lys-Sci-Ala-Thr-Ilc-Lys-Val-Ihr-Asp-Ala-Scr: 
(Sl'O ID NO; 12^)1 

2(1 (k i Ala (11 V Asp Thr Xaa lie I i:.'A'al( .I\ - \sfi-l lhi^ i hr- Ala- Asp: 

(Si-:(J ID N( ). 1 ^.()| oi 
(1) Ala-Pn>-Glii-Scr-(rl\-Ala-(.l>-LrLi (r[\ (ih lhr-Wil ( ilii-Ala-Cily; 
(Sl:0 ID NO: 131) 



wliCK'ii! \aa (iia\ lu- am ammd acid 
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(m) Xaa- ryr-JIc-Ala- ryr-Xaa-Thr-Thr-AIa Gly Ile-Val-Pro-GIy-Lys-Ilc- 
Asn-Val-JIis-i.cu-Val; (SHQ ID NO; 132) or 
(n) Asp-Pro-Pro-Asp-FVo-Uis-(jIn-Xa£i-Asp-Mct-r}u-Lys-Gly-7yr-Tyr- 
Pro-Gly-Gly-Arg-Arg-Xaa-Phc; (SEQ ID NO: 124) 
5 wherein Xaa may be any amino acid. 

In another embodiment, Ihc s'.>luble A/ tuberculosis antippFi comprises an 
amino acid sequence encoded by a DNA sequence selected from tlie group consisting of the 
sequences recited in Sb;Q ID NOS: K 2, 4-10, 13-25, 52, 94 and 96, the compleinents of said 
sequences, and DNA sequences that hybridize to a sequence recited in SHQ ID NOS; 1, 2, 
10 4-10, 13-25, 52, 94 and 96 or a complement thereof under moderately stringent conditions. ' 

In a related aspect, the polypeptides comprise an antigenic portion of a 
A/, tuberculosis antigen, or a variant of such an antigen that differs only in conservative 
substitutions and/or modifications, wherein the antigen comprises an amino acid sequence 
encoded hy a DNA sequence selected from the group consisting of the sequences recited in 
15 Sb:Q ID NOS: 26-51, 133, 134, 158-178 and 196, the complements of said sequences, and 
DNA sequences tliat hybridi/e to a setjuence recited in SP:Q ID NOS; 26-51, 133, 134, 158- 
1 78 and 196 or a complement thereol under moderately sti ingent ct)nditions. 

In related aspects, DNA sequences encoding the above [)ol> peptides, 
recombinant expressi(>n \ ectors c(Hn]irising these DNA sequences and h(isl cells trans] ormed 
20 o\ iransfccled \\\\\\ such c\[^i"ession \ ectors are also prox'ided. 

In aiioUicf :ispccl, ilie piescnl ]n\enlii>n proMde.s iusion protems ctiniprismi: a 
tii^l aiul a secv>iKi iinentixe poIyjK'plKle or, alternali vel\ . an in\enti\e f>ol\fieptide and a 
known M (uhcrculosis antigen. 

hi tuillier asj^ects ol [he subject nuention, methods and diagnostic kiis atv 
iMovhlrd loi liftrLtir!!' Inbcicnlosis m a j^alient. [ he rnclliods comprise: (a) contacting a 

1^,. ,! , : ' ■ 



! !IC il.lL'llo.'!. [, 
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The present invention also provides methods for detecting M. tuberculosis 
infection comprising: (a) obtaining a biological sample from a patient; (b) contacting the 
sample with at least one oligonucleotide primer in a polymerase chain reaction, the 
oligonucleotide primer being specific for a DNA sequence encoding the above polypeptides; 
5 and (c) detecting in the sample a DNA sequence that amplifies in the presence of the first and 
second oligonucleotide primers, in one embodiment, \hc oiigomiclcoiiiic primer comprises at 
least about 10 contiguous nucleotides of such a DNA sequence. 

in a further aspect, the present invention provides a method for detecting 
M. tuberculosis infection in a patient comprising: (a) obtaining a biological sample from the 
10 patient; (b) contacting the sample with an oligonucleotide probe specific for a DNA sequence 
encoding the above polypeptides; and (c) detecting in the sample a DNA sequence that 
hybridi/cs to the oligonucleotide probe, hi one embodiment, the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of such a DNA sequence. 

In yet another aspect, the present invention provides antibodies, both 
1 5 polycKnial and monoclonal, that bind to the polypeptides described above, as well as methods 
for their use in the detection of y\/ (uhcrculosis infection. 

These and other aspects of the preseiit invention will i^ecimic ai:)parent upon 
reference to the Ibl lowing detailed deseription and attachetl drawings All refcreiiees 
disclosed herein arc herein- incniptiratcd in reference ni then enlirel\ it' e:ich \Kas 
20 Hicorperated nulivuiuallv. 

URIl 1 1)1 SCRUM ION Ol 1 1 If. DRAWINGS AND SI-:QI 'bNC d: ll)i:\riMb:RS 

f igure l.A and H illustrate the slunulation ot [proliferation and inlcrleron y 
produetion in T denxcd iroin a fu'st and a second A/ ////n7't7//f>.s7\-i!iimune vlontM. 

!es[>een\'ei\ . h\ the 14 Kd, ^f) Ktl and Kd aiitii'ens described m fxaniple 1 
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Figure 3A illustrates the stimulation of proliferation in a TbH-9-spccific T cell 
clone by secretory M. tuberculosis proteins, rccombin^mt 'rbF^-9 and a control antigen, 
TbRal L 

Figure 3B illustrates the stimulation of intcrfcron-y production in a TbM-9- 
3 specific T cell clone by secretory M. tuberculosis protenis, PPD £ind recombinant 'rbH-9. 

Figure 4 illustrates the reactivity of two rpnrcsentativc polypeptides with sera 
from A/ tubcrculosis-mizcXQ^d and uninfected individuals, as compared to the reactivity of 
bacterial lysate. 

Figure 5 shows the reactivity of four representative polypeptides with sera 
10 from M. tubcrculosis-iniccicd and uninfeeted individuals, as compared to l^e reactivity of ific^ 
38 kl) antigen. 

Figure 6 show^s the reactivity of recombinant 38 kl) and TbRal 1 antigens with 
sera from A/, tuberculosis patients, FPI.) positive donors and normal donors. 

Figure 7 show^s the reactivity of the antigen TbRa2A with 38 kl) negative sera. 
15 Figure 8 show'S the reactivity of the antigen of SliQ H) NO: 60 with sera from 

A/, tuberculosis patients and normal donors. 

Figure 0 illustrates the reactivity of the recombinant antigen rbII-29 (Sh'Q ID 
NO: P>7) with sera trom M (ubercnlosis patients, PPI) positive doiKM's ami normal donors as 
dLMrrniiriL-d h\ iiKliiet t \'\ ISA 
2*' figiiic in illuslKitL'N the iLMCliMlv ot the irLO[ni>iii;iiit ,inlii>e[) rbM-"^"> (SI-(,) 

11) \( »: 14<M -jia iMun A/ luiu'rculosis [xiliciU.s and \um iioinial doruns, anil with ;t poi^] 
o\ Ncia Irom M. luhcrcu/oMs patients, as determined both b\' direct and indirect I-FISA 

Figuie 1 1 ihustrates the reaeti\ity of increasing concentrations ol the 
lecombinant antigen IF!FV> (Sl (^) ID NO: l lO) wnh sera triMu M. lulx-uailosis i\Uicnh- anJ 
2^ troiii iioMiial donors ,is dctcnniiicd In 1,MSA. 



N. I I . i I 1 ■ . . . ■ " o 
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SEQ. ID NO. 5 is the DNA sequence of TbRaLl. 
SEQ. ID NO. 6 is the DNA sequence orTbRal6. 
SEQ. ID NO. 7 is the DNA sequence ofTbRaH. 
SEQ. ID NO. 8 is the DNA sequence of! bRal8. 
SEQ. ID NO. 9 is the DNA sequence of rbRal9. 
SEQ. ID NO. 10 is the DNA stn|Mcncc of ThRa24. 
SEQ. ID NO, 1 1 is the DNA sequence of 'rbRa26. 
SEQ. ID NO. 12 is the DNA sequence of TbRa28. 
SE:Q. id NO. 13 is the DNA sequence of TbRa29. 
SI'XJ. ID NO. 14 is the DNA sequence of TbRii2A. 
SEQ. ID N(). 15 is the DNA sequence of 1 bRa3. 
SI ;Q. id no. 16 is the DNA sequence of ^fbRa32. 
SEQ. ID NO. 17 is the DNA sequence of TbRa35. 
SEQ. ID NO. 18 IS the DNA sequence of TbRa36. 
SEQ. ID N( ). 19 IS the DNA sequence of TbRa4. 
SEQ. ID NO. 20 IS the DNA sequence of TbRaO. 
S1:Q. ID NO. 21 is the DNA sequence of TbRaB. 
SI-Q ID \0. 22 is the DNA setjuence of TbRaC\ 
SI (,). ID \(). : M> the DNA sequence ol' ! hR;iI ) 
Sl:(,), ID \r) 24 1 ; the DNA sequence ol'WW'CVi - 
S\:[j. ID \( ). 2.-^ i> tfic l.»NA sequence AAMK. 
Si ;(,), ID \(). 20 1 ; the I )NA sequence of rhI .-23. 
SI':(J. ID NO 2'7 IS the l )\A sequence of TbE-24. 
Sf (,), ID NO. 2S i;. the DNA >equence of Tbl -2^ 
SE(2. ID N( ) 2') !> the DNA sequence ol fhl 2S. 
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SBQ. ID NO. 35 is the DNA sequence ol I bM-B. 

SI:Q. id no. 36 is the DNA sequence of rbM-6. 

SEQ. ID NO. 37 is the DNA sequence of TbM-7. 

SI:0. ID NO. 38 is the DNA sequence of TbM-9. 
5 SRQ. ID NO. 39 is the DNA sequence of TbM-12. 

SIiQ. ID NO. 40 is the DNA sequence of TbM-l 

SliQ. ID NO. 41 is the DNA sequence of TbM-14. 

Sf;Q. ID NO. 42 is the DNA sequence of l bM-15. 

Sf;Q. ID NO. 43 is the DNA .sequence of TbH-4. 
10 SFO- ID NO. 44 is the DNA sequence of TbII~4-FWD. 

Sl.O. ID NO. 45 is the DNA sequence of TbIM2. 

SLQ. ID NO. 46 is the DNA sequence of Tb38-1 . 

SI:0. ID NO. 47 is the DNA .sequence of Tb38-4. 

ShO. ID NO. 48 is the DNA .sequence of TbL-17. 
15 SliO. ID NO. 49 is the DNA .sequence of 'rhh-20. 

SlMJ. ID NO. 50 is the DNA sequence of TbL-21. 

Sl g. ID NO. 51 is the DNA sequence of ThI I- 1 6. 

Sfg. ID NO. 52 is the DNA sequence of DPl-P. 

S] i). ID NO. is Ihc deduced aniiud acul >OL]ue!ice ofDri F. 
20 SI (,). ID NO. \ IS the protein sequciiL\Mi!d )P\' N-tcrniina! Anliuen. 

Si <j, II) NO. is the prt^lcm scqiicrue ol A\'(iS N-Ierniinai .Antiuen. 

S\ <). 11) NO. 5o IS the protein sequcnee ol .\.\MK N lerininal .AnliL^en. 

SfO. ID NO. 57 IS the protein sequence ofA'VWO N-lerminal .Anti^^cn. 

Sf'O. ID NO. is the pr(itein sequence oT 1 )I( iS N-tcrtninai Antiijen 
.'"^ SI (,) il ) NO ^'O tlir prouan s^'quern r ol Al '1;S NMe[-mi[iaI Antiueii 
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ShQ. ID jNO. 65 is the deduced amino acid sequence of TbRalO. 
SEQ. ID NO. 66 is the deduced amino acid sequence of TbRal 1 . 
SEQ- IE) NO. 67 is the deduced amino acid sequence of TbRal 2. 
SEQ. ID NO. 68 is the deduced amino acid sequence of TbRal 3. 
SEQ ID NO. 69 is the deduced amino acid sequence of TbRal 6. 
SEX), in NO 70 is the deduced amino acid sequence ufT^liRai?. 
SEQ. ID NO. 71 is the deduced amino acid sequence of TbRal 8. 
SEQ. ID NO. 72 is the deduced amino acid sequence of TbRal 9. 
SEQ. ID NO. 73 is the deduced amino acid sequence of TbRa24. 
SI'Q. ID NO. 74 is the deduced amino acid sequence of TbRa26. 
SI (,). ID NO. 75 is the deduced amino acid sequence of TbRa2S. 
Sl'i ). ID NO. 76 is the deduced amino acid sequence of 'fbRa29. 
SI^Q. ID NO. 77 is the deduced amino acid sequence of rbRa2A. 
SI'Q. ID NO. 78 is the deduced amino acid sequence of TbRa3. 
ShY). ID NO. 79 is tlie deduced amino acid sequence of rbRa32. 
Sh"Q. ID N( ). 80 is the deduced amino acid sequence of ThRa35. 
SI:Q. ID N( ). 81 is the deduced amino acid sequence of 'fbRa36 
S1:Q. ID NO. 82 is the deduced amino acid sequence of rbRa4. 
SI'(j ID ^il ) X> r> the deduced amino aeid .sequence (^f Tblv'i^f 
Sl'(,>, ID Ni ). S4 1. tlie deduced ammo aeiJ scLjiience of TfM^iH 
SI (J. II) N( ). X."^ IS the ilcduced amino acid sequence of I bRal '. 
ShQ. 11) 86 IS the deduced amino acid sequence ot IbRaD. 
Sl'O. II) ). 87 IS the deduced ammo acid sequence of YV\V(1*Ci 
S1-(J. If) N< ). SS IS the deduced ammo aeui sequence of TbAAMK. 
Si (,) II ) ). 89 IS the deduced amitui acui scLjueiicc oi I'h ^S- 1 
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SEQ. ID NO. 95 is the deduced amino acid sequence ofDFAS. 

SHQ. ID NO. 96 is the DNA sequence of DP V. 

SFQ. ID NO. 97 is the deduced amino acid sequence of DPV. 

SBQ. ID NO. 98 is the DNA sequence of i:SAT-6. 
5 SMQ. ID NO. 90 is the deduced amino acid sequence of FSA1'-6. 

Sf:Q. II) NO. 1 00 is the DNA sequence ofl"bIi-8-2. 

SHQ. ID NO. 101 is the DNA sequence of TbH-QFL. 

SHQ. ID NO. 102 is the deduced amino acid sequence of rbM-9FL. 

SFQ. ID NO. 103 is the DNA sequence of TbH-9-l. 
10 SFQ. ID NO. 104 is the deduced amino acid sequence oi~^rbF F9-l . 

SFQ. ID NO. 105 is the DNA sequence orTbIF9-F 

SFQ. ID Nt ). 106 is the deduced ammo acid sequence of 1 b[F9-4. 

Sf.Q. ID No. 107 IS the DNA sequence oFl b38-lF2 IN. 

SFQ. ID NO. 108 IS the DNA sequence of I b38-1F2 Rl>. 
15 SI-Q. ID No. 109 is the deduced amino acid sequence of Tb37-FF. 

SfQ. ID N< ). 1 10 is the deduced amino acid .sequence of Tb3S-IN. 

Si:0. ID NO, II 1 is the DNA sequence of Ih38-Il'3. 

SFQ. ID Ni ). ] 12 IS the deduced ammo acid sequence of 11)38-1 1'3. 

SI:q. id \( ) 1 the DNA sajucncc of rb>X 1I\n. 
20 SfO. ID N'( ). 114 :s i\vj i)N \ sequence o! lb3X-lI'7v 

Si (J. II ) N(), 1 i ! , t he dcti!ic.\i 's terunna! amino acid sequence o\' I )F\'. 

SI (J. 1 1 ) N( 1. ] 1 o i,> the deduced \ iciinmai ammo acid sequence of .A V^C iS. 

S\ { ). ID Nt 1. 1 F' is the deduced N-tenninal annno acid sequence of A.AMK. 
II) NO. I IS is the deduced \-terimnal ammo acid sequence o\ \Y\\{ 

S|-(J ll)\() ' the deduced \'-ieiiniiMl amitid acid sequence (^t DKiS, 
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SEQ ID NO. 125-128 are the protein sequences of tour DPPD cyanogen bromide 
fragments. 

SEQ ID NO. 129 is the N-tcnninal protein sequence of XDS antigen. 
SEQ ID NO. 130 is the N-terminal protein sequence of ACiD antigen. 
SI-Q ID NO. 131 is the N-terminai protein sequence of APE antigen. 
SEQ ID N(3. 132 is the N-tcrminal protein sequence of XY! antigCii. 
SI-Q ID NO. 133 is the DNA sequence of Tbn-29. 
SEQ ID NO. 1 34 is the DNA sequence of 'rbH-30. 
SEQ ID NO. 135 is the DNA sequence of 'I bH-32. 

SEQ ID NC). 136 is the DNA sequence of TbH-33. • 

SEQ ID NC). 137 is the predicted amino acid sequence of TbM-29. 

SEQ ID NO, 138 is the predicted amino acid sequence of 1 bH-30. 

SEQ) ID NO. 139 is the predicted amino acid sequence of TbH-32. 

SI:Q ID NO 140 is the predicted amino acid sequence of I bIE33. ' 

Sf Q ID NO: 141-146 are PCR primers used in ihc preparation of a fusion protem 

containing TbRal, 38 kl) and Tb38-E 

SI'Q in NO; 147 is the DNA scc]ucncc of the fusion protein containing 'rbRa3. 38 kD 
and rb38-l. 

ID NO: 1 4S is the ammo acid scqueiiee of the luMini prut cm iMiiilainiri!' I bRa v 
>S kl) and rh^>S-E 

S\XJ ID N( ): [■U) IS the 1)N.\ s-.-quenec ol llie \1. tuberculosis antiuen 'S kD. 
SEQ 11) NO: 15t) is the ammo acid sequence of the M. tubcrcuh^sis antigen 38 kD. 
Si;Q|j)\0: 151 IS the DNA sequence of XP 14, 
Si:n []) NO: E^: is the DNA sectuenee ot XP:4 
SI (J in N( ): EN^ is the DN.\ .sequence o! XP31 
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SHQ ID NO: 158 is the DNA sequence of XP27. 

SEQ ID NO: 159 is the DNA sequence of XP36. 

StQ ID NO: 160 is the 5' DNA sequence of XP4. 

SEQ ID NO: 161 is the 5^ DNA sequence of XF5. 
5 SEQ ID NO: 162 is the 5^ DNA sequence of XP17, 

SIiQ H) NO: 163 is the 5^ DNA sequence of XIMO 

SEQ ID NO: 164 is the 5^ DNA sequence of XP2. 

SEQ ID NO: 165 is the 3^ DNA sequcFice of XP2 

SEQ ID NO: 166 is the 5' DNA sequence of XP3. 
10 SEQ ID NO: 167 is the 3^ DNA sequence of XP3. 

SEQ ID NO: 168 is the 5' DNA sequence of XP6. 

Sf Q ID NO: 169 is the 3^ DNA sequence of XP6. 

SfQ ID NO: 170 is the 5^ DNA sequence of XPI8. 

SEQ ID NO: 171 is the 3^ DNA sequence of XPl 8. 
15 Sf 0 ID NO: 172 is the 5^ DNA sequence of XP19. 

SE.Q ID NO: 173 is the 3' DNA sequence of XP19. 

SI-:Q id NO: 174 is the DNA sequence orXP22. 

SI :Q id NO: 175 is the V DNA sequence of XP22. 

SI Q ID NO: 176 is the 5' DNA sa|ueiRe ofXP:^^ 
2i) SI 0 ID NO: 177 is the ^" DNA scqueiKc ot XP2^ 

SI ID NO: 17S is ihc tuil-Icnulli DNA scqik-nre n\ j h! 14-XPI . 

S\ O ID ^;()- 1 "^0 IS the pivJictcd :iniiiu^ .wid ;.cquencL' oi l hi 14-XlM, 

SI (J ID N(): ISO is the predicted amino acid sequence encoded h\' the reverse 

lonipicnient of rh[i4-XPl. 
2.^ SK,) ID N'O: IS] is a fii^t pivdietcd ainino acid sequence encoded t\\ XP3f>, 
(.) f!) \'n 1 . ^ ' . I, • : : ■ ■ ■ 



■■■■■ : f ,> ■ M ^ ^ ■ ■^ ^ : . ■ " ' ^: i 

^ I (,» i I ) V ■ * A ' ■ ;: ■ 1 ■\ . .\:!.e;u ■ ■ ' ' i ^'1 
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SEQ ID NO: 186 is the DNA sequence of IIDIF8. 

SEQ ID NO: 187 is the DNA sequence of RDIF i 0. 

SEQ ID NO: 188 is the DNA sequence of RDIF 1 1. 

SEQ ID NO: 1 89 is the predicted amino acid sequence of RDIF2. 

SEQ ID NO: 190 is the predicted amino acid sequence of RDIF5. 

Sl:Q ID NO: 191 is the predicted amino acid sequence of Rnn-^>^. 

SF:Q id NO: 192 is the predicted anuno acid sequence of RDIFIO. 

SliQ ID N( ): 193 is the predicted amino acid sequence of RDIFl 1 . 

SEQ ID NO: 194 is the 5' DNA sequence of RDIF 1 2. 

SE.O ID N(»: 195 is the 3' DNA sequence of RDIF 12. 

SEQ ID N( >: ] 96 is the DNA sequence of RDIh7. 

SI'Q ID NO: 197 is the predicted amino acid sequence of RDII'7. 

SFQIDNO: 198 is the DNA sequence of DIF2-I . 

SI:Q ID N(): 1 99 is the predicted amino acid sequence of DIF2-1 . 

SEQ ID NO: 200-207 are PC^R pnmcrs used in tlie prepciration ol a fusion protein 

containing I bRa3, 38 kl), rb38-] and DPI'P (hereinafter rcierred to as TbF-2). 

SI'Q ID N( ): 208 is the DNA sequence of the fusion protein 'rbI-2. 

Sl 'O ID N( >: 209 is the annuo acid sequeuee of the fusion protein l'bF-2. 

[)l I \[! ! !) !)] S( RIPilDN ()\ liii iWIMlON 

As noted abo\'e. the present iiu'cntion is ijenerall}' directed to coinpositu^is 
and methods for diaLinosuig lubereuh)sis. I he eompositions of the subject itnention include 
pt>l>pepiides liuit et>mprise at least one auliueiuc portion ot~a .1/ lui\'rcul()si\ antiijen. or a 
\aMani o! m:Ji an aiiliueii that dilteis onl\ ni eonserxatne subslitutu>ns ;ind or moditieations 
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a polypeptide comprising an antigenic portion of one of the above antigens may consist 
entirely of the antigenic portion, or may contain additional sequences. The additional 
sequences may be derived from the native M tuberculosis antigen or may be heterologous, 
and such sequences may (but need not) be antigenic. 
5 An "antigenic porlion" of an antigen (which may or may not be soluble) is a 

portion that is capable of reacting with sera obtained from an M tuhc'rculosis-micci^iX 
individual {i.e.. generates i\n absorbancc reading with sera from infected individuals that is at 
least three standard deviations above the absorbancc obtained with sera from uninfected 
individuals, in a representative liLISA assay described herein). An "A/ ////)£^;rw/r>.v/.v -infected 

10 individual" is a human who has been infected with A/, tuberculosis (e.^., has an intrademial 
skin test response to PPD that is at least 0.5 cm in diameter). Infected individuals may 
display symptoms ol~ tuberculosis or may be free of disease symptoms. Polypeptides 
comprising at least an antigenic portion of one or more M. tuberculosis antigens as described 
herein may generally be used, alone or in combination, to detect tuberculosis in a patient. 

15 I'he compositions and methods of this invention also encompass variants of 

the above polypeptides. A "variant." as used herein, is a polypeptide that diifers iVom the 
native antigen only in conservative substitutions and/or modifications, such that the antigenic 
properties of the polypeptide arc retained. Such variants may generally be identified bv 
modifying one of the above poI\ peptide sequenees. ;iiK] ev aluatiDi: liie aniigen'ie pinjvrties o\ 

.-0 the modified pol>'pepIule using, lor example, tlie leptesenlatn e pKK'edures dcsei ibed herem. 

■\ "con>er\'aii\ e siibsliluUon" is one in which an ammo acid is snhstituted lor 
another ammo aeid tiiat has similar [iroperties, such that one skilled in the art of peptide 
ehennstiy would expect the secondan' structure and h\ dn>pathie nature of the [^olvpeptide to 
he siihstantialU iinehangx'd In general, the IoIIowihl' groups of ammo aeuls rvpresenl 
coii^ei value changes ( 1 ) ala. puf, , I'hi. aNp. gin. asn, ser. ihr: i ) e\'s. ser. \\ \\ tlii'. ' \ a' 
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Iranslalionally or post-translationally directs transfer of the protein. Fhc polypeptide may 
also be conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide (e.g., poIy-His), or to enhance binding of the polypeptide to a 
solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc 
5 region. 

In a related aspect, combination polypeptides arc disclosed. A "coinbiiicilion 
polypeptide" is a polypeptide comprising at least one of the above antigenic portions and one 
or more additional antigenic M (uherciilosis sequences, which are joined via a peptide 
linkage into a single amino acid chain. The sequences may be joined directly {i.e., with no 

10 intervening ammo acids) or may be joined by way of a linker sequence (e.g., Gly-Cys-Cily) 
that docs not significantly diminish the antigenic properties of the component polypeptides. 

In general, M. tuberculosis antigens, and DNA sequences encoding such 
antigens, may be prepared using any of a variety of procedures. For example, soluble 
antigens may be isolated from M tuberculosis culture filtrate by procedures known to those 

15 of ordinar}' skill in the art, including timon-exchangc anti reverse phase chromatography. 
Purified antigens may then be evaluated for a desired property, such as the ability to react 
with sera obtained from an A/ tuberculosis-\\\i^<tc{<:d indixidual. Such screens may he 
perfonned using the representative metht)ds described herein. Antigens may tlicn be partially 
sequenced usiuu. lor example, ti-aditiotial I'dmaii cheinisirv. ^sVt' I'dnian and lieri', /■';//■ ./ 

20 lUochcm S(rA lO l h)h7. 

Aiitii^cits tii<iv uIa' W produced rccoinhinaiUl v umul: a DN.A sequence tliat 
cncodo the iiiitigcn. wlncii has been mscrtcd mto an expression \ ectv>r and expressed m an 
appiopiiate host I)\ \ molecules encodmg soluble antigens ma> be isolated [^\' screening an 
afiprofMialc M (uiicrculosis L^xprcssion librarv with anli-scra d ;/ . raf^bit) raised spcciilcallN 

?^ aiMinsl solu'hlc \/ luhcrmlt nis an^lL^el)^ UNA si-qnciu cs riieiHliiii' antigens thai ma\ o\ nia\ 
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DNA sequences encoding soluble antigens may also be obtained by screening 
an approprialc M. tuberculosis cDNA or genomic DNA library for DNA sequences that 
hybridize to degenerate oligonucleotides derived from partial amino acid sequences of 
isolated soluble antigens. Degenerate oligonucleotide sequences for use in such a screen may 
be designed and synthesized, and the screen may be performed, as described (for example) in 
Sambrook et aL, Molecular Cloning: A Lahoratory Manun/^ Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY (and references cited therein). Polymerase chain 
reaction (PCR) may also be employed, using the above oligonucleotides in methods well 
knowii in the an, to isolate a nucleic acid probe from a cDNA or genomic library. The library 
screen may then be performed using the isolated probe. 

Regardless of the method of preparation, the antigens described herein arc 
"antigenic." More specifically, the antigens have the ability to react with sera obtamed from 
an M. fuhcrculosis-iniccicd individual. Reactivity may be evaluated using, for example, the 
representative FJJSA a.ssays described herein, where an absorbruicc reading with sera from 
infected individuals that is at least three standard deviations above tiie absorbance obtained 
with sera from uninfected individuals is considered positive. 

Anligeiiic [)ortions uf \f tuberculosis antigens may be prepared and identitlcd 
using well known techniques, such as those sumrnari/ed in PaLii, I-'wukimcutuI lmnnuu)lo<^\\ 
nl od., Ra\en Press, pp 24.^^-^47 and references cited therem. Such techniques include 

screenmii polypeptide [Portions of the nati\e antisjen lor antigenic properties | he 
icpresentalive I I fS.Xs Jescriix'd herein nia\ generally he employed in these screens. .\n 
aiUigenic portion oi a pohpeptide is a porticMi tliat, within such representative assa\'s. 
generates a signal m such assays that is substantial!>' similar to that generated In* the full 
Icnyth antii'cn, hi other words, an atUigcinc portion of a M ful\i\ kIoms aiUiLien iiencratcs at 
leas! ahoii; .^0".,. ami iMvferaM\ about ot the Mgnal induced h\' the full leiu'tfi aniircn 
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commercially available solid-phase techniques, such as the Merrifield solid-phase synthesis 
method, where amino acids are sequentially added to a growing amino acid chain. See 
Merrifield, ./ Am. Chem. Soc. (V5:2I49-2146, 1963. Hquipmcnt for automated synthesis of 
polypeptides is commercially available from suppliers such as Applied BioSystems, Inc., 
5 Foster City, CA, and may be operatcci according to the manufacturer's instructions. Variants 
of a nalive artigc'n may generally be prepared using standard mutagenesis techniques, such as 
oligonuclcotide-uiiccied siie-specilic mutagenesis. Sections of the DNA sequence may also 
be removed using st;indard techniques to permit preparation of truncated polypeptides. 

Recombinant polypeptides containing portions and/or variants ol" a native 

10 antigen may be readily prepared from a DNA sequence encoding the polypeptide using a 
variety of techniques well known to those of ordinar\' skill in the art. For example, 
supernatants from suitable host/vector systems which secrete recombinant protein into culture 
media may be tlrst concentrated using a commercially available filter. I-'ollowing 
concentration, the concentrate may be applied to a suitable purification matrix such as an 

15 affinity matrix or an ion exchange resin. Finally, one or more reverse phase IIPLC steps can 
be employed to further purify a recombinant protein. 

Any t^f a variety t)f cxfircssion vectors known to tliosc of ordinary skill in the 
art may he employed to express recombinaiU fiulypeptides as described herein, lixpression 
may be achieved in an\' appropriate host cell that has bciMi transformed or transiected with an 

20 e\pressu>n \ color oontaitiinti a DNA molecule that eiicocles a recombinant poh'pcptide. 
Suitable host cells ihcIikIc prokaivote^;. vca^t and Inehcf cukar\(^t!c cells. [*rclcrabl\, the host 
cells emploved are I: cnl;, \cast or a tuaninialian cell line, such as COS i)\ e'llo, I he DNA 
sequences expressed in this manner may encode naturall}' occurring antigens, portions of 
naturall}' occurrini: antigens, or other \ ariants thereof 

In general. rcL:ai\ilcss ot tlic iiiciIiolI o!' preparation, the [^oK pcpndcs disclosed 
liereni >iir iiropaieii ni siil^st.inti.ilb. t^'" - t(w.. tv i ' ' 
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In certain specific embodiments, the subject invention discloses polypeptides 
comprising at least an antigenic portion of a soluble M. tuberculosis antigen (or a variant of 
such an antigen), where the antigen has one of the following N-terminal sequences: 

(a) Asp-Pro- Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Clly-Gln- 
5 Vai-Vai-AIa-Ala-Leu (SI:Q ID NO: II 5), 

(o) Ala-Val-Glu-Ser-Gly-Mct-Leu-Ala-Leu-GIy-'ihr-Pro-Ala-Pro-Ser 

fSHQIDNO: 116); 
(c) Ala-Ala-Met-Lys-Pro-Arg-rhr-(iIy-Asp-Cily-Pro-Leu-Glu-Ala-Ala- 
Lys-Glu-Cily-Arg (SEQ ID NO: 117); 
10 (d) I\T-ryr-Trp-Cys-Pro-Gly-Ciin-Pro-Phe-Asp-Pro-Ah^'rrp-Giy-Pro 

(SI^Q ID NO: 118); 

(c) Asp-Ilc-Gly-Scr-(;hi-Scr-'rhr-Cilu-Asp-Ghi-Gh>Xaa-Ala-Vai (S1:Q ID 
NO: 119); 

(0 A]a-Glu-(ih]-Ser-lIe-Scr-Thr-Xaa-(ilu-Xaa-I]c-VaMVo (SF'Q ID 
15 NO: 120); 

(g) Asp-Pro-(Jki-Prt>-Ala-Pro-Pro-Val-Pr(>'rhr-Thr-Ahi-A]a-SLT-Pro-Pri> 

Scr(SF.Q ID NO: 121); 
Ui) Ala-Pro-I ys-Thr-'Iyr-X;ia-(iIu-(iki-ixHj-Pys-(iIy-Thr-Asp-Thr-C)iy 

(Sf -O ID NO: 122): 

20 (!) Asp P[-(^ Ala Scr Ala Pro ■ \^pA'al-pI<^- i hr Ala- Ala-( iliiA rlii- Thr-Scr- 

! cii ! cu- \:.;i SvM-i cti-.Ma .\->p Pi*^ .Ami \ al Set l*hc Ala Asii (SliU 
ID NO: i:.^). 

( j) Xaa-Asp-Scr-(iiu-L>s-Scr-AIa-l hi-Ilc-I ysA aP 1 hr-Asp-Ala-Scr: 
(Sl (:) If) \.{ ) 12^)t 

.^^ (k( Ala-( rl_N- Asp ! li:-\a,t- III'- I vi -I lc-\ aPt ,i \ -Ami-1 cu-I hr-A la-Asp, 
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amino acid sequence of which is provided in SliQ ID NO: 53. A DNA sequence encoding 
the antigen identified as (a) above is provided in SI-Q ID NO: 96; its deduced amino acid 
sequence is provided in SEQ ID NO: 97. A DNA sequence corresponding to antigen (d) 
above is provided in SHQ ID NO: 24, a DNA sequence corresponding to antigen (c) is 
5 provided in SEQ ID NO: 25 and a DNA sequence corresponding to antigen (I) is disclosed in 
SI:0 ID NO: 94 anti it<; deduced ainino acid sequence i> provided in SI^O ID NO: 95. 

In a further specific embodiment, the subject invention discloses polypeptides 
comprising at least an immunogenic portion of an A/ tuberculosis antigen having one of the 
following N-terminal sequences, or a variant thereof that differs only in conservative 
1 0 substitutions and/or modifications: 

(m) Xaa-'l\T-ilc-Ala- ryr-Xaa- rhr-l hr-Ala-Cilydlc-Val-I^ro-Cily-I.ys-lle- 
Asn-Val-IIis-Lcu-Val; (SEQ ID NO: 132) or 

(n) Asp-i^ro-Fro-Asp-Pro-His-(]ln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr-Tyr- 
15 Pro-(}ly-Gly-Arg-Arg-Xaa-Phc; (SPIQ ID NO: 124) 

wherein Xaa may be any amino acid, preferably a cysteine residue. 

In other specific embodiments, the subject invention discloses polypeptides 
comprising at least an antigenie portion of a soluble M luhvrculosis antigen (or a variant of 
such an aiUigen) th.at a>mpnscs one or more of the aniiiu^ aciJ sequences encoded b\ (a) the 
DNA sequences ot Si (j II) NOS: 1. 1 -1 K). l.V^s. y\ 04 and ^)h, fh)tiie complements o\ 
such DNA sequences, or (c) DNA sequences substanlialK homoloi^ons to a sequence in (a) 01 
(b). 

in further specific einhodmicnts. the subject ]n\'ention disch>ses polvpefituies 
ciuiiprisuig al Icasl an aulitn-iiic porMou of a 1/ iuhcrc}ih).\is anliiien (or a \ ariant t'f Mu h an 
2."^ anuucii), winch nia_\ o{ nww not he sohihlc, that cdmiMiscs oin- uinic t'^- . > ■ ; 
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or more of DNA sequences specifically recited herein. "Substantial homology," as used 
herein, refers to DNA sequences that are capable of hybridizing under moderately stringent 
conditions. Suitable moderately stringent conditions include prewashing in a solution of 5X 
SSC, 0.5% SDS, 1.0 mM HDTA (pM 8.0); hybridizing at 50°C-65°C, 5X SSC, overnight or, 
in the evenl of cross-species homology, at 45°C with 0.5X SSC; followed by washing twice 
at 65°C foi 20 mmutes with each of 2X, 0 5X and 0.2X SSC containing 0.1% SDS). Such 
hybridizing DNA sequences are also within the scope of this invention, as arc nucleotide 
sequences that, due to code degeneracy, encode an immunogenic polypeptide that is encoded 
by a hybridizing DNA sequence. 

In a related aspect, the present invention provides fusion proteins comprising a 
Hrsi and ii second inventive polypeptide or. alteniativcly, a polypeptide of the present 
invention and a known A/ tuhercidosis antigen, such as the 38 kD antigen described above or 
l-:SA'l -6 (Sl^Q ID NOS: and 99), together with variants of such fusion proteins. The 
fusion proteins of the present invention may also include a linker peptide between the first 
and second polypeptides. 

A DNA sequence encoding a fusion protein of the present invention is 
constructed using known recombinant DNA techniques to assemble separate DNA sequences 
encoding the first and second polypeptides into an appropriate expression vector. The 3' end 
of a DNA sequence enccxhng t!ie first pi)l\ peptide is liijated, with or without a pe[niLlc linker, 
to the 5' end (>t a DNA sequence encodiiii: the seeoiul poKpeptide so that the reaciing frames 
of the sequences arc ni phase \o permit mi(NA iiaii:.Iatiiiii of the t^^o DNA sequences into a 
siiM'ie fusion protein that retains the [biological acti\'it\ o\ both the first and the second 
[ioly [Peptides. 

/\ pe[^tide linker sequence iiia\ he eniiiK>\ed to sepaiate the first and the 
second poUpefnides by a distance Miflicieiit lo ensure llial each [loKpeplidc loMs iiiU) its 
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or charged residues that might react with the polypeptide functional epitopes. Preferred 
peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which 
may be usefully employed as linkers include those disclosed in Maratca et al.. Gene 40-39-46^ 
5 1985, Murphy eta!., Proc Natl Acad. Sci. USA <*^J:82.58-8562, 1986; U.S. Patent 
No. 4,935,233 and U.S. Patent No. 4,751,180. The linker sequence may be from 1 to about 
50 amino acids in length. Peptide linker sequences are not required when the first and second 
polypeptides have non-essential N-tenninal amino acid regions that can be used to .separate 
the functional domains and prevent steric hindrance. 

10 In another aspect, the present invention provides methods for using the 

polypeptides described above to diagnose tuberculosis. In this aspect, methods are provided 
for detecting A/, tuhercidosis infection in a biological sample, using one or more of the above 
polypeptides, alone or in combination. In embodiments in which multiple polypeptides are 
employed, polypeptides other than those specifically described herein, such as the 38 kl) 

15 antigen described in Andersen and H ansen, Itifcct. Inwiwi. 5/;2481-248S, 1 989, may be 
included. As u.sed herein, a "biological sample" is any antibody-containing sample obtained 
from a patient. Prelerabl}, the sample is whole blood, sputum, scrum, plasma, saliva, 
cerebrospinal lluid or urine. More preferably, the sample is a blood, serum or plasma sam[ile 
obtained Irom a patient or a h\oo.\ supply [he ptilype[Miile(s ) are used in an assa\-. as 

2') described belo\^ . to deternune tlie presence or absence of aniibodics lo the poK pcrl ide( s ) m 
the sample. ie!al:\c to a ["cdcteriniiied eul o\\ \ahie. i he presence ol such antibodies 
indicates [m\:\iou> sen>ili/ation to nn cobacierial antigens uhich mav be indieati\'e of 
tuberculosis. 

In L-niboJiments iii which nun'c than one j^oUpcfMide is employed, the 
."^^ pol_\ Pv-plid^sN Used .iir pri-Kaabl\ coioplciiieniai \ one coniptment po)\'peptide will tend 
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formulated that are capable of detecting infection in most, or all, of the samples tested. Such 
polypeptides arc complementary. For example, approximately 25-30% of sera from 
tuberculosis-infected individuals arc negative for antibodies to any single protein, such as the 
38 kD antigen mentioned above. Complementary' polypeptides may, therefore, be used in 
5 combination with the 38 kl) antigen to improve sensitivity of a diagnostic test. 

^rhere are a variety of assay formats known to those of ordinaiy .skill in the art 
for using one or more pt)lypeptides to detect antibodies in a sample. Scc\ e.g.^ Harlow and 
Lane, Antibodies: A Lahoraiory Manual. Cold Spring Harbor Laboratory, 1988, which is 
incorporated herein by reference. In a preferred embodiment, the assay involves the use of 

10 polypeptide immobilized on a solid support to bind to and remove the antibody from the 
sample. The bound antibody may then be delected using a detection reagent that contains a 
reporter group. Suitable detection reagents include antibodies that bind to the 
antibody/poiypcptide complex and free polypeptide labeled with a reporter group (c*.^., in a 
semi-competitive assay). Alternatively, a competitive assay may be utilized, in which an 

15 antibody that binds to the polypeptide is labeled with a reporter group and allowed to bind to 
the immobilized antigen after incubatitMi of the antigen with the sample. The extent to which 
components of the sample inhibit tiie binding of the labcletl antibody to the polypeptide is 
indicative of the reactivity of the sample wilh the iinniobilized polypeptide 

The solid support nia^ he an\ solid niaienal known \o those tM' ordinar\- skill 
in the at1 to whieh the ;itUigen m:i\- be atlaehed. l or example, the solid support ma\ he a test 
well in a nueioliter plate oi a nilrocelluit^se or titlier suitable membrane. .Mlcmainvlv, the 
support ma\ be a bead or disc, such as glass, fiberglass, latex or a plastic material such as 
polystyrene or pol> \ inylchlonde. The support may also be a magnetic particle or a fiber 
optic sensor, such as tliose disclosed, lor exaitipie. in I ',S. Patent No. >.,\*^^^f>S I . 

^-■^ I lie [^ol\ peptuies [na\ be bouiui lo tiie solid support usnm a \ ariel\ ol 
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membrane is preferred. In such cases, adsorption may be achieved by contacting the 
polypeptide, in a suitable buffer, with the solid support for a suitable amount of time. The 
contact time varies with temperature, but is typically between about 1 hour and 1 day. In 
general, contacting a well of a plastic microliter plate (such as polystyrene or 
5 polyvinylchloridc) with an amount of polypeptide ranging from about lOng to about 1 |Lig, 
and preferably about 100 ng, is sufficient to bind an adequate nmount nf antigen. 

Covalent attachment of polypeptide to a solid support may generally be 
achieved by first reading the support with a bifunctional reagent that will react with both the 
support and a functional group, such as a hydroxyl or amino group, on the polypeptide. For 

10 example, the polypeptide may be bound to supports having an appropriate polymer coating 
using benzoquinone or by condensation of an aldehyde group on the support with an amine 
and an active hydrogen on the polypeptide (scc\ c.i^., Pierce Immunotechnology Catalog and 
Handbook, 1991, at A12-A13). 

In certam embodiments, the assay is an en/yme linked immunosorbent assay 

15 (1:LISA). This assay may be performed by first contaetmg a polypeptide antigen that has 
been immobilized on a solid support, commonly the well of a microliter plate, with the 
s.implc, sucli that antibodies to (he polypeptide within the sample are allowed to bind to the 
iinnu)biii/ed polypeptide. Unbound sample is then remo\'eci from the imnidbilized 
pt)lypeplKie and a dcteelion reagent capable (if bindinii id ilie iinniohili/ed antibod\- 

20 polypeptide complex i> added. The amount of detecti(>n reaiicnl tfiat remains hound to the 
solid support in then determineJ using a method appropi'iate for the specific detection reagent. 

More specifically, once the polypeptide is nnmobili/ed on the support as 
described above, the remaimng protein binding sites on the support are tvpicallv blocked. 
An\' suitable blocking agent knttwn to those ol oidinarv skill in the art. siicii as bo\ine scrum 

2,-^ alhumin oi- Iween J^t'" (Signia (■[leniieal ( o,. St. l.ouis. M(»| may he emplove2 Hie 
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of binding that is at least 95% of that achieved at equilibrium between bound and unbound 
antibody. Those of ordinary skill in the art will recognize that the time necessary to achieve 
equilibrium may be readily determined by assaying the level of binding that occurs over a 
period of time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

[Jnbound sample may then be removed by washing the solid sunnori with 
appropriate buftcr, such as PBS containing 0.1% Twecn 20"^'^^ Detection reagent may then be 
added to the solid support. An appropriate detection reagent is any compound that binds to 
the immobilized antibody-polypeptide complex and that can be detected by any of a variety 
of means known to those in the art. Preferably, the detection reagent contains a binding agent 
(such as, for example, Protein A, Protein G, immunoglobulin, lectin or free antigen) 
conjugated to a reporter group. Preferred reporter groups include enzymes (such as 
horseradish peroxidase), substrates, colactors, inhibitors, dyes, radionuclides, luminescent 
groups, fluorescent groups and biotin. I he conjugation of binding agent to reporter group 
may he achieved using standard methods knowai to those of ordinary skill in the art. 
Common binding agents ma\' also he purchased conjugated to a variety of reporter groups 
from many commercial sources {cai., /ynicd Laboratories. San I'rancisco, (/A, and Pierce, 
Rtickford, If ). 

The JctecUori reagent is tlicn incubated \\ilh the inini(ihiii/cd anlil^(Hi\ - 
pt)l> fieptklc conif'jlcx lor an aniounl of nine suffieicnl detcci ihe bound anlihcKK . .An 
a[ipn^pnalc aniouiit ol time nia\ ucneraliv he deternuiKHl Ironi the rnaiiutaclurer's instructunis 
or b\ assa\ [itg the lev el ot binding that oeeuis o\ er a period of time. I 'iiboiUKl detection 
reagent is then retno\'ed and ht>und delection reagent is detected usin^; tlie reporler group. 
Ihe nietfuui L'inplined for detectmi! the repmler grou[^ depends upon the nature ot the 
re[io:tL*! vronji l oi iadu>-ieti\e latMips, inti llalion eounting or autoradiograpfne methods 
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\ o determine the presence or absence of anli-M tuberculosis antibodies in the 
sample, the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predelcnriined cut-off value. In one 
preferred embodiment, the cut-off value is the average mean signal obtained when the 
5 immobilized antigen is incubated with samples from an uninfected patient. In general, a 
siimplc generating a signal that is three standard deviations above the predeicrinined cut-olt 
value is considered positive for tuberculosis. In an alternate preferred embodiment, the cut- 
off value is determined using a Receiver Opcnitor C'urvc, according to the method of Sackctt 
etal., Clinical Epidemiology. A Basic Science for (finical Medicine, Little Brown and Co., 

10 1985, pp. 106-107. Briefly, in this embodiment, the cut-off value may be determined from a 
plot of pairs of true positive rates (i.e., sensitivity) and false positive rates ( 1 00%-spcciricity ) 
that correspond to eacii possible cut-off value for the diagnostic test result. The cut-off value " 
on the plot that is the closest to the upper left-hand corner (i.e., the value that encloses the 
largest area) is the most accurate cut-off value, and a sample generating a signal that is higher 

15 than the cut-off value determined by this method may be considered positive. Alternatively, 
the cut-oil value may be shifted to the left along the plot, to minimize the false positive rate, 
or to the right, to minimize the false negative rate. In general, a sample generating a signal 
that is higher than the cut-off \'alue determined by this method is considered positive for 
tubciculosis. 

20 In a loiatcd cnibodifiicnt, a^sa\ is p;.'rfornieJ in a rapui tlow-tliroui'h 

stnp k'sl tortnat. wherein tiic antigen is irninnbili/cd on a nienibrane, siirh as nitroci'ihilosc 
In the llow -through lesl. antibodies within llic sample bind to the iminohili/cd poKpeptide as 
tiic sample passes through tlie membranL\ A detection reagent prt)lein .A-colloidal gold) 
then i^inds to the a!Uibod\' pol\pcptKle complex as the solulitMi einUaining tfie detection 
reagent How s through tlie membrane. I he deteclUMi of l^tuind detection reatjent nia\- ihen he 
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detection reagent at that site generates a pattern, such as a hne, that can be read visually. The 
absence of such a pattern indicates a negative result. In general, the amount of polypeptide 
immobilized on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of antibodies that would be sufficient to generate a positive 
5 signal in an I-^LISA, as discussed above. Preferably, the amount uf polypeptide immobilized 
on the membrane ranges from about 25 ng to about 1 |.ig, and more prcicrabh from nhniit 
50 ng to about 500 ng. Such tests can typically be pcribnned with a vcr\' small amount (e.^., 
one drop) of patient serum or blood. 

or course, rmnicrous other assay protocols exist that are suitable for use with 
H) the polypeptides of the present invCiHion. The above descriptions :hc intended to be 
exemplary only. 

In yet another aspect, the present invention provides antibodies to the 
inventive polypeptides. Antibodies may be prepared by any ofa variety of techniques known 
to those of ordinary skill in the ail. .S'c'c\ e.,£;. , llarlow and Lane, Antibodies: A Laboratory 

15 Manual. Cold Spring Harbor Laboratory, 1988. In one such technique, an immunogen 
comprismg the antigenic polypeptide is initially injected into any of a wide variety of 
mammals (cj; , mice, rats, rabbits, sheep and goats). In this step, the polypeptides of this 
invention may scr\'e as the immunogen without modification. .Alternatively, particularly for 
relaliv'cU .short pol\ pL'[^tid(.-s, ;t superit)!" iinimiiK* rcs[><)n^t' ina> be elicited il tlie polNpeptide 

.^n IS itMHcd \o a earner [nolem. siieii as Inuiiie seiuin alliiiiTiin or ke\'ho!e iimpc't hemocN'aniiv 
The imnuiiUM'en is mjceted luie ihe animal h^l^l, preiei^ibi} .Kcoidmi- to a predeterinined 
schedule meorporating one or move booster ininuini/ations, and the animals are bled 
[ieri(HiicaII\ . Polveional antiboLlies specilic tor ihe polypeptide may then be purified from 
such antisera b>, for example. affmil\ elironialoLirapIn usmL' (he poKpeplide coupled lo a 

J'^ Liilable solid support. 
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from spleen eells obtained from an animal immunized as described above. The spleen cells 
are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably 
one that is syngeneic with the immunized animal, A variety of fusion techniques may be 
employed. For example, the spleen cells and myeloma cells may be combined with a 
5 nonionic detergent for a few minutes and then plated at low density on a selective medium 
that supports the growth of hybrid cells, but not myeloma cells. A prefeireu selection 
technique uses HAT (hypoxanthine, aminoptcnn, thymidine) selection. After a sufficient 
time, usually about 1 to 2 weeks, colonies of hybrids arc observed. Single colonies are 
selected and tested for binding activity against the polypeptide. Ilybridomas having high 

10 reactivity cind specificity arc preferred 

Monoclonal antibodies may be isolated from the supernatants of growing 
hybridoma colonics. In addition, various techniques may be employed to enhance the yield, 
such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate 
host, such as a mouse. N4onoclonal antibodies may then be harvested from the ascites fluid or 

15 the blood. C\^ntaminants may be removed Irom the antibodies by conventional techniques, 
such as chromatography, gel filtration, precipitation, iuid extraction. The polypeptides of this 
mvcntion may be used in the purification process in, fbi' example, an affinity chromatography 
step. 

.Antihodiivs ina\- he used m diayiioslie tests to detect tiie jiresence ol 
.^0 \f {uhi'fculoMs antii'rns iimim' assa\s similar lo tlid.sc detailed ali(>\'e and niher teelinique.N 
well kiunvn [o those ot .skill ui the att, tluarii} pioMdinr .i iTuaiiod lor dotertins' 
M fuhcrcidosis infection in a patient. 

Diagnostic reagents of the prescdl in\enti(>n nut\ al.so eonqirisc DNA 
sctjuenccs enc(Hliii;^ one or more of the abo\ e pol_\ peptides. <n one or more poitions thereof 
-^-"^ 1 (^r example, at least two oligonucleotide primers nia\ be enipioved in :i poi\rnerase chain 
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present invention may be used in a hybridization assay to detect the presence of an invenlive 
polypeptide in a biological sample. 

As used herein, the term "oligonucleotide primer/probe specific for a DNA 
molecule" means an oligonucleotide sequence that has al least about 80%, preferably at least 
5 about 90% and more preferably at least about 95%, identity to the DNA molecule in question. 
Oligonucleotide primers and/or probes which may be 'jsefull^ employed in the i-nventive 
diagnostic ineihods prelcrably have at least about 10-40 nucleotides. In a preferred 
embodiment, the oligonucleotide primers comprise at least about 10 contiguous nucleotides 
of a DNA molecule encoding one of the polypeptides disclosed herein. Preferably, 

10 oligonucleotide probes for use in the inventive diagnostic methods comprise 'at least about 15 
contiguous oligonucleotides of a D\A molecule encoding one of the polypeptides disclosed 
herein. Techniques for both PCR based assays and hybridization assays are well known in 
the art (see, for example, Mullis ei ul. Ihid\ Ehrlich, Ibid). Primers or probes may thus be 
used to detect A/. tuherculosis-^pcc\r\Q sequences in biological samples. DNA probes or 

15 primers comprising oligonucleotide sequences described above may be used alone, in 
combination with each other, or with previously identified sequences, such as the 38 kD 
antigen discussed above. 

The Ibllowiiiij l Aamplcs arc oltcrvd h\ \\\\\ (^f :lhistrati(Mi and not h\ \\,\\ ol 

?() limitation 

1 X.WlPl.IS 
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M. tuberculosis (either H37Ra, ATCC No. 25177, or H37Rv, ATCC 
No. 25618) was cultured in sterile GAS media at 37°C for fourteen days. 7 he media was 
then vacuum filtered (leaving the bulk of the cells) through a 0.45 filler into a sterile 2.5 L 
bottle. The media was then filtered through a 0.2 fi filter into a sterile 4 L bottle. NaN^ was 
5 then added to the culture filtrate to a concentration of 0.04%. The bottles were then placed in 
a 4°C cold room. 

The culture filtrate was concentrated by placing the filtrate in a 12 L reservoir 
that had been autoclaved and feeding the filtrate into a 400 ml Amicon stir cell which had 
been rinsed with ethanol and contained a 10,000 k[)a MWCO membrane. The pressure was 
10 maintained at 60 psi using nitrogen gas. This procedure reduced the 12 L volume to 
approximately 50 ml. 

The culture filtrate was then dialy/ed into 0.1% ammonium bicarbonate using 
a 8,000 kDa MWCX) cellulose ester membrane, with two changes of ammonium bicarbonate 
solution. Protein concentration was then detemiined by a commercially available RCA assay 
15 (Pierce, Rockford, IL). 

The dialy/ed culture filtrate was then lyophilized, and the polypeptides 
resuspended in distilled watci". The polypeptides were then dialv/ed against 0.01 mM 1,3 
bis|tris(hydroxyrncthyl}-mcthylaminc)lpropanc, pll7.5 {Bis-Tns propane huifer). the initial 
conditions for anion exchariku- chinnKitoiMaphy I-'racUdnation was pcrlormcd using \jx\ 
20 protusitMi cIinMnatoi'rapfu a P( )[\( )S 140 [I luwou exchange colinnii 4.6 mm x 

lOOnini (Perscpti\o i^uiSysicm^, I' r:imi[{i:ii;mi, MAi L-qiHiibKiicd \n 0 (H inM Ris-lris 
propane bulTcr pH 7.5. l\)lypcpudcs were ciuied \Mtli a linear 0-0.5 M NaC'l gradient m the 
above hutTer system. The column ehient was nuMiiuired at a wavelength of 220 nm. 

1 he pools of polvpcjitidcs clutiiiL' troni the \on exclianue column were 
-'■^ dialwcd <iL:a]nst distilled water aiul i\ophili/ed. Ilie resullini' matenat was ilissohed in n.]"„ 
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to maximize the purity of the individual samples. Approximately 200 purified polypeptides 
were obtained. 

The purified polypeptides were then screened for the ability to induce T-cell 
proliferation in PBMC preparations. The PBMCs from donors known to be PPD skin test 
5 positive and whose T ceils w^crc shown to proliferate in response to PPD and crude soluble 
proteins from MTR were cultured in medium comprising RPMI 1640 supplemented with 
10% pooled human serum and 50 f.ig/ml gcntamicin. Purified polypeptides were added in 
duplicate at concentrations of 0.5 to 10 |.ig/niP. After six days of culture in 96-well round- 
bottom plates in a volume of 200 50 \i\ of medium was removed from each well for 

10 determination of Ih'N-y levels, as described below. The plates were then pulsed with 
i faCi/well of tritiated thymidine for a further 18 hours, harvested and tritium uptake 
determined using a gas scintillation counter. [Tactions that resulted in proliferation in both 
replicates three fold greater than the proliferation observed in cells cultured in medium alone 
were considered positive. 

15 ll'N-y was measured using an en/ymc-linkcd immunosorbent assay (I:LISA). 

f LISA plates were coated with a mouse monoclonal antibody directed to human IFN-v 
(Chcinicon) in PBS for four hours at room temperature Wells were then blocked with PBS 
ccmtaininj^ 5% (W'V) non-ial dried milk lor 1 hour at room temperature. The plates were 
llien washed six times m PHS'O.Z^o TWI'TN-ZO aiki samjiles diliitt-d ] ^ in culture medium 

20 in the ivMS.A plates were nicubated (uorniiihl at Kxun temperatur.* The filale> werr auaiii 
washed and a po! wKuial ral^bil aiili-hiinian Il X-y scrum diluted l:,'^n()() m PBS/l()*\i normal 
enat serum was adde*.! to eaeii well. 1 iie [elates were then mcubated for two ht)urs at room 
temperature, washed and horseradish penixidase coupled anti-rabbit li^Ci (.lackson Labs.) was 
acklcd at a 1 :.^()()() Liilution m PBS'.'^"., lUMi-tal dried milk. a further hour incubation 

'It KHuii tcinpctatuie. t!ic [Mates were waslica anJ 1MB substrate added. Mie reaction was 
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For sequencing, the polypeptides were individually dried onto 
Biobrcne^"^ (Perkin Elmer/Applied BioSystems Division, Foster City, CA) treated glass fiber 
filters. The filters with polypeptide were loaded onto a Perkin Ehtier/Applied BioSystems 
Division Procisc 492 protein sequencer. The polypeptides were sequenced from the amino 
terminal and using traditional Edman chemistry. 1 he amino acid sequence was detennined 
for each polypeptide by comparing the retention time of the PTII amino acid derivative to the 
appropriate PTH derivative standards. 

Using the procedure described above, antigens having tlie following 
N-terminai sequences were isolated: 

(a) Asp-Pro-Val-Asp-AIa-Val-Ilc-Asn-l1ir-l1ir-Xaa-Asn-Tyr-Gly-(iln- 
Val-Val-Ala-Ala-Lcu (SKQ ID NO: 54); 

(b) Ala-Val-rTlu-Scr-Cily-Met-Leu-Ala-Lcu-Cily-Thr-Pro-Ala-Pro-Ser 
(SEQ ID NO: 55); 

(c) Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-Ala- 
Lys-Glu-(ily-Arg (SP:Q ID NO: 56); 

(d) Tyr"Tyr-Trp4'ys-Pr()-(ny-(iln-Pro^Phe^Asp-Pr()-Ala-l'rp~Cily-i^ro 
(Sf;QII)N(): 57); 

(c) Asp-lle-Gly-Scr-Cilu-Scr- rhr-Cilu-Asp-Ciln-Gln-Xaa-AIa-Val fSP;U ID 
\() • 

(!^^ Aia^( rlii-(;hi-Sc! [Ir-Srr- [1ir-\aa-( ilu-Xaa-IlL' A'aNPn^ iSbi,) []) 
\< V 50). 

(U) Asp-ProAihi-Pni-Ala-lM-o-ProA^tl-l^(>-riir-Ahi^Ala~Ala-Ahi-Pn)-Pio-- 

Ala iSi:o in NO: 60); and 
(h) Ala-Pro I vs l lii- - jX r-Xaa-i ihi-( rhi-I ,cn-l vsA ilv- Thr- Asp- I iirA ;i\ 

(SP<,> II) NO: (^\ ): 
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Cily, CA) with a 7 micron pore size, column size 1 mm x 1 00 mm, in a Perkin Elmer/Applied 
Biosystcms Division Model 172 HPLC. Fractions were eluted from the coluimi with a linear 
gradient of 1%/minute of acetonitriie (containing 0.05% TFA) in water (0.05% TFA) at a 
flow rate of 80 |il/minutc. The eluent was monitored at 250 nm. The original fraction was 
separated into 4 major peaks plus other smaller components and a polypeptide was obtained 
which was shown to have a molecular weiL^ht of 12.054 Kd (by mass spcctiunieiry) and the 
following N-terminal sequence: 

(i) Asp-Pro-Ala-Ser-AIa-Pro-Asp-Val-Pro-'rhr-AIa-Ala-(}ln-Ciin-rhr-Ser- 

Lcu-Fcu-Asn-A^yi-Lcu-Ala-Asp-Pro-Asp-Val-Scr-Phe-Ala-Asp (SEQ 

11) NO: 62). 

This polypeptide was shown to induce proliferation iuid II-N-y production in PBMC 
preparations using the assays described above. 

Additional soluble antigens were isolated from M. tuberculosis culture filtrate 
as follows. A/, tuberculosis culture filtrate was prepared as described above. Following 
dialysis agamst Bis-Tris propane buffer, at pH 5.5, fractionation was pcrfonned using anion 
exchange chromatography on a Poros C^l; column 4.6 x 100 mm (Perseptive Biosystcms) 
equilibrated in Bis-'fris propane buffer pll 5.5. Polypeptides were eluted with a linear 0-1.5 
M NaCl gradient in the above buffer system at a How rate of 10 ml/min. I he column eluent 
was nioiiitoied at a vsavclcngtli (^1 214 nm. 

TliL' tractions elutin*' trcMii tb.c ipii cxchanec cohnni^ were pooled and 
subjected to rc\erse phase elironiatoi'.iapliv usiiu' .i i^oios R.^ eohinin 4.f> \ ]{){) inni 
( Persepti\ e }iios\ stems i. Poly peptides were ehited Ironi the eohinin with a linear gradient 
from ()-IOf)''n acetonitriie (0.1 ^> liA) at a (l(nv rate of 5 ml min. I iie eluerU was monitored 
at 214 nm. 

I laetions contamnii: the eluted poK peptuies were i\-opfiil!/ed and re^uspended 
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The fraction with biological activity was separated into one major peak plus 
other smaller components. Western blot of this peak onto PVDF membrane revealed tliree 
major bands of molecular weights 14 Kd, 20 Kd and 26 Kd. These polypeptides were 
determined to have tJie following N-temiinal sequences, respectively; 

(j) Xaa-Asp-Ser-Glu-Lys-Ser-Ala-'rhr-Ile-Lys-Val-Thr-Asp-Ala-Ser; 
(SEQID NO: 129) 

(k) Ala-Gly-Asp-rhr-Xaa-Ile-Tyr-Ilc-Val-Gly-Asn-Leu-Thr-Ala-Asp; 
(SliQin NO: 130) and 

(1) AlaT^ro-Cihi-Ser-Gly-Ala-GIy-Leu-Cily-Cily-'lhrA^al-GIn-Ala-Gly; 
(SHQ ID NO: 131), wherein Xaa may be any amino acid. 
Using the assays described above, these polypeptides were shown to induce proliferation and 
IFN-y production in PBMG preparations. iMgs. lA and H show the rcsiihs of such assays 
using PBMC preparations from a first and a second donor, respectively. 

DNA sequences that encode the antigens designated as (a),.-(c), (d) and (g) 
above were obtamed by screening a A/, tuberculosis genomic library using ^'P end labeled 
degenerate oligonucleotides corresponding to the N-terminal sequence and containing 
M fuhcn ulosis codon bias. The screen performed using a probe corresponding to antigen (a) 
above identified a ch)nc having the sequence pro\'ided in S\\Q ID NO: 06. The polypeptide 
ciicodcii h\ SI'(^) II) NO: ^)C> is pixn'uled in Sl (^) ID NO: The screen pcrlormcd using a 
probe eonvspdndin^j m iinlti'en ([') ;iho\e Klciitilied a clone haMni: the sequence pro\'uied ni 
S!:(^) M) NO: 50. 1 lie f>oi> [lejilide eiieoded b> SIQ II) NO: >2 is [^nnided m 11) 
NO. 5,^v I he screen pei lc^nied usmg a probe corresponding in antigen (d) abo\*e identitled a 
clone having the sequence pn>vided in S\ () ID NO: 24, aiul the screen performetl with a 
probe con*espi>iKhni' [o anlii'cn (e) identitled ;i cUmic having tlie sequence prtuided in S\-.i) ID 
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7 he amino acid sequence for antigen (i) was found to be homologous to a 
sequence from A/ leprae. Tlie full length M. leprae sequence was amplified from genomic 
DNA using the sequence obtained from GHNBANK. This sequence was then used to screen 
an M. tuberculosis library and a full length copy of the M. tuberculosis homologue was 
5 obtained (SHQ ID NO: 94). 

The amino acid sequence i'or antipcn (i) was found to be homologuus to a 
known hi. tuberculosis protein translated from a DNA sequence. To the best of the 
inventors' knowledge, this protein has nt)t been previously shown to possess T-cell 
stimulatory activity, llie amino acid sequence for antigen (k) was found to be related to a 
10 sequence from A/, leprae. 

In the proliferation and lI'N-7 assays described above, using three PPD 
positive donors, tlie results for representative antigens provided above are presented in Table 
1: 

15 1 AHLl-: 1 

Rf.si s Of- PHMC Proi 11 kration and IbN-v Assays 



Sequence \ 

(a) ' 


IVvilifcration 

\ 


II-N-7 




(c) : 


i + 


- + -t- 




(d) i 


4-+ 










+ +--+ 




(h) 1 


+++ 


+++ 





III I al^lc 1, responses that .i stinniiation mdc\ (Sh of hi'tw'ccn and ^ 
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indicate thai these antigens are capable of inducing proliferation and/or interferon-y 
production. 

E XAMPLE 2 

5 Use ()[' IVmii:n r Skra To lr.o\AT\ : A / IVnKRci '/.os/s Antigens 

This example illustrates the isolation of antigens from M. tuhercidosis lysate 
hy screening with scnim from M. tuhcrculosi.s-'miccicd individuals. 

Dessicatcd A/ tuhercidosis H37Ra (Difco Laboratories) was added to a 2% 
10 NP40 solution, and alternately homogenized and sonicated three times. The resulting 
suspension was centrifuged at 13,000 r7)m in microfugc tubes and the supernatant put through 
a 0.2 micron syringe filter. The filtrate was bound to Macro Prep DFAF beads (RioRad, 
Hercules, CA). I'he beads were extensively washed w^ith 20 mM Tris pi I 7.5 and bound 
proteins eluted with IM NaCl. Tlie NaCl elute was dialyzed overnight against 10 inM Tris, 
15 pH 7.5. Dialyzed solution was treated with DNase and RNase at 0.05 mg/ml for 30 min. at 
room temperature and then with aT])-mannosidase, 0.5 IJ/mg at pi I 4.5 for 3-4 hours at room 
temperature. After returning to pll 7.5, the material was fractionated via hPLC over a Bio 
Scale-()-2f) column (HioRad). f-ractious were combined into nine pools, coneentrated in a 
(\'ulri[irvfi 10 ( Amuon, nc\'er]cy. MA) and screened b\' Western blot lor serological acli\it\ 
r'O iismi' a sciLini pooi lioin \t nihi'ri ul()\!s~\n\c<:W\\ patients uliich was not imnuin(M'eacti\'e 
\Mlh other anugen^ of the present iin enlioii 

I he most reacti\e iVaclion was run ni SI)S-PA(i[ aiul tianslened to PVDI . A 
band at approximateU S5 Kd was cut out \ ielding the seL]Lieiice; 

(m) Xaa-'rvr-Ile-A]a-T\r-Xaa^Thr-Thr-,\ia^(i[\-Ile-\'aUPro-(i]\-l.\s-Ile- 
^'5 Ami Val-llisT euA'ai. (S|-(,) ID \'( >: T^J). wherein Xaa ma\ he an\- 
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degenerate oligonucleotides corresponding to the N-tcrminal sequence of SEQ ID NO: 137. A 
clone was identified having the DNA sequence provided in SHQ NO; 198. This sequence 
was found to encode the amino acid sequence provided in SEQ II) NO: 199. Comparison of 
these sequences with those in the gcnehank revealed some similarity to sequences previously 
5 identified in M tuberculosis and M hovis. 

F.XAMPLE 3 

PRtJ^ARATION Ol- DXA S£OUi:NC HS liNCODING A/ Tl IBERClJiOSIS \m\G\i\^?> 

10 This example illustrates the preparation of DNA sequences encoding 

M tuberculosis antigens by screening a hi. fuhcrculosis expression Hbraiy with sera obtained 
from patients infected with A/ tuberculosis, or with anli-scra raised against M. tuberculosis 
antigens. 

15 A . PRr.PARATioN or M. 7f-7im'(;Aas7\ Soli'Bm: ANTiGf:NS u sinc; RAim n ANri-si:R A 

RaISHH ACiAlNST M. JBt- KCl ![X)S1 S Si;Fn:RNATANr 

(rcnoniic DNA was isohitcd from the A/, tuberculosis stram II37Ra. The DNA 
was randomly sheared and used to construct an expression hbrary using the Lambda ZAP- 
expression svstcrn (Stratagenc. I a Jolia, CA). Rabbit anii-scra was roiiL-ralod .irainsl 

20 sccri'tor\ prtncins ot" the A/ tuhcrculo\i\ strams ll.^>"Ra, I137R\ aiul I ithiian by iiiniuini/ing a 
rabbit concentrated Miperiiatant ol the ,\/ luhcrculosis cultures. Sficcificallw tiic rabbn 
was llr.sl ininuiiii/ed siii^eutaneousl}' with 200 ot protein antigen in a total \x)ki[nc o{ 2 ml 
c(MUainmg 1 00 fig tnuramyl dipeptidc ( Calbiocheni, La Jolla. V \) and 1 ml of incomplete 
Lreunds adiii\'aiit l our weeks later the rabl^it was boosted subeutanetnisl)' with lOOpg 

-"^'^ aiitii'en ni nuiMuplele 1 roiiiul\ ail|ii\ant. I iiiail\. llie rabbit was nnnnini/ed intr"a\ enousK 
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Thirty two clones were purified. Of these, 25 represent sequences that have 
not been previously identified in M. tuberculosis. Proteins were induced by IPTG and 
purified by gel elution, as described in Skeiky et al., ./ Exp. Med, /SI :\ 527-1 537^ 1995. 
Representative partial sequences of DNA molecules identified in this screen are provided in 
5 SEQ ID NOS: 1-25. The corresponding predicted amino acid sequences are shown in SEQ 
ID NOS: r)4-S8. 

On comparison of these sequences with knowii sequences in the gene bank 
using the databases described above, it was found that the clones reibrred to hereinafter as 
TbRA2A, TbRAIo, TbRA18, and TbRA29 (SFQ ID NOS: 77, 69, 71, 76) show some 

10 homology to sequences previously identified in Mycobacterium leprae but not* in 
M. tuberculosis. IbRAl 1, TbRA26, rbRA2S ami fbDPEP (SI-Q ID NOS: 66, 74, 75, 53) 
have been previously identified in M. tuberculosis. No significant iioniologics were found to 
TbRAl, TbRA3, IbRA4, rbRA9, fbRAlO, TbRA13, IbRAH, TbRA19, TbRA29, 
TbRA32, TbRA36 and the overlapping clones TbRA35 and TbRAl 2 {SEQ ID NOS: 64, 78, 

15 82, 83, 65, 68, 76, 72, 76, 79, 81, 80, 67, respectively]. 1hc clone TbRa24 is overlapping 
with clone 'I bRa29. 

B Usi- ()[ Si:ra from Patiknts havi\(, Pu[ai()nary or I'lja i kAL Ti'Bf:R(n:i.( )sis to 
I ijirvnrv .ON A jS r( h 'i ;Nn:s I ■ \( ( )|);n'( . A / / / vf/ 7,;,^/ <-)vv AN'ri('; i:Ns 

I he DNA lihrar> dcsL-rihcJ ahcnc. anti an additional n.^>7R\- hbrar\. 

v\cic >ciL"cneti nsitiL' [>ooIs slt^i (tlnaiiKvi [Mlinils w ith a(. Ii\r tuhcivulosis. l o jncparc 

iIk- [I.^>7R\ lihrar\, M iui'crcnloM.^ siraiii I137I\\ gcnonuc DNA was isolated, subtceted to 
partial Sau3A digestKMi and used to consuiicl an expression library using ihc Lambda Zap 
expression system (Strataecne, 1 a Jolla, Ca) Three different pt>ols ol sera, each cuntainiiii' 
2'^ seta ohlained troni diree ;ndi\ idiial,^ \\\\h aelivc piilrnonarv or pleural disease, were used 111 
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lacked increased reactivity with the recombinant 38 kD M. (uherculosis iI37Ra phosphate- 
binding protein. 

All pools were pre-adsorbed with E. coli lysate and used to screen the H37Ra 
and lI37Rv expression libraries, as described in Sambrook et aL, Molecular Cloning: A 
5 Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989. 
Bacteriophage plaques expressing immunoreactive antigens were purifif^H Phagemid from 
the plaques was rescued and the nucleotide sequences of the Ad. tuberculosis clones deduced. 

Thirty two clones were purified. Of these, 31 represented sequences that had 
not been previously identified in human M. tuberculosis. Representative sequences of the 

10 DNA molecules identified are provided in SFQ 1^ NOS:: 26-51 and 100. Of these, TbH-8-2 
(SHQ. ID NO. 100) is a partial clone of TbH-S, and TbH-4 (SKQ. ID NO. 43) and Tb]K4- 
P'WD (SCO, ID NO. 44) are non-contiguous sequences from the same clone. Amino acid 
sequences for the antigens hereinafter identified as Tb38-1, TbH-4, TbH-8, TbH-9, and 
rbM-12 are shown in SHQ ID NOS.: 89-93. Comparison of these sequences with known 

15 sequences in the gene bank using the databases identified above revealed x\o significant 
homologies to TbI 1-4, I bH-H, TbH-9 and fbM-3, although weak homologies were found to 
TblI-9. rbH-12 was found to be homologous to a 34 kl) antigenic protein previously 
identified in M panUuhcrculosis (Ace. N(\S28515). lb3K-] was found to he located 34 
base pairs upslreani of tlio oprii rradini' ii<in^^ for tlic antigen l'SAi-6 prc\ iousl\- klcnlilicd 

?(l in A/ /vMvv (Acc. No. r.UX4K) .nitl in \1 fuhiTHflosis (Sorcnscn et til.. Infci Ininiuti 
<^^i^il)-l^l7. M)9S,. 

Probes dciived tn>m lb3S-l and I hll-'), botli isokited from an H37Ra librarw 
wav used to identilv clones in an H37R\ library. fb3S-l Inf^idi/ed to Tb38-lf2, rb38- 
II v Ib38-ll ^ and fh.^S-H C) (Sl'O. ID NOS: li)-". lOS, 11!,] I V a\k\ 114}. (SI-O II) KOS: 

2*^ ]n7 and lOS arc non-eontiguous sequences Iroin clone 1 b.'^S-ll 2.) i open reading fr<unes 
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ID NO. 105) is a partial clone of TbH-8. The deduced amino acid sequences for these three 
clones arc presented in SEQ ID NOS: 102, 104 and 106. 

Further screening of the M tuherculosis genomic DNA library, as described 
above, resulted in the recovery often additional reactive clones, representing seven different 
5 genes. One of these genes was identified as the 38 Kd antigen discussed above, one was 
determined to be identical to the I4Kd alpha cpy^stallin heat shock nrntcin previously sliowri 
to be present in A/ /ubcrculosi\\ and a third was determined to be identical to the antigen 
TbH-8 described above. The determined DNA sequences for the remaining five clones 
(hereinafter referred to as 'fbH-29, TbII-30, TbII-32 and TbH-33) are provided in SF.Q ID 

10 NO: 133-136, respectively, with the corresponding predicted amino acid sequences being 
provided in SllQ ID NO: 137-140. respectively, 'fhe DNA and amino acid sequences for 
these antigens were compared with those in the gene bank as described above. No 
homologies were found to the 5' end of rbH-29 (wliich contains the reactive open reading 
frame), although the 3' end of TbH-29 was found to be identical to the A/, tuherculosis 

15 cosmid Y227. Tbll-32 and 'fbH-33 were found to be identical to the previously identified 
A/ tuherculosis insertion element iS61 10 and to the /U tuherculosis cosmid Y50, 
resi^eclively. No significant homologies to TbII-30 were iound. 

Positive phagemid from this additional screening were used to infect coli 
XI -1 Bhic MRl''. as described iti Sjinbiook c\ siij^i^i hnliiclinn oi lecomhinaiit protciti 

'^i) wjs acconipiishcd In the addition o\ \V\i\ IikUkwI ;iint ufiitidiuwl K'sates wcir iiin ni 
duplicalc i>i\ Sl)S-P.A(ii- atui trans! erred lo iiitrocclluIo.se fillcis I'lhcis wcfc reacted witli 
luiman St. tuberculosis sera (1:200 dilution) reactive with Tbll and a rabbit sera (1:200 or 
1:250 dilution) reactive \Mth the N-lerminal 4 Fvd ponion cd' lac/. Sera incubations were 
performed for 2 hmirs at idom temperaliire. Mound aiitibod\ v^:ls detceted hy aildition oi' ' '4- 

2.^ labeled Prtttcin .A and >ui\sciiiieiil exposure to tilm t(»r \ariable times r:ini:ini: troni ]h lunir.^ 
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TABLE 2 



Human M. tb Anti-lacZ 
Antigen Sera Sera 

TbH-2« 45 Kd 45 Kd 

TbH 30 Ni) reaciivity 29 Kd 

TbFI-32 12 Kd 12 Kd 

TbFI-33 16 Kd 16Kd 



Positive reaction of the recombinant human M. tuberculosis antigens with both 
the human M. tuberculosis sera and anti-lacZ sera indicate that reactivity of the human A/ 
tuberculosis sera is directed towards the fusion protein. Antigens reactive with the anti-lacZ 
sera but not with the human M. tuberculosis sera may be the resuU of the human M. 

15 tuberculosis sera rccogni/Jng conformational epitopes, or the antigen-antibody binding 
kinetics may be such that the 2 hour sera exposure in the immunoblot is not sufficient.**. 

Studies were undertaken to determine whether the antigens rbH-9 and [ b38-l 
represent cellular proteins or are secreted into .\f tuberculosis culture media. In the first 
study, rabbit sera were raised against A) secrctor\ proteins of A/ tuberculosis, H) the known 

2(> scci*ctor\' tccombmaiU M. lubcrculoMs aiiiiecii S.^b, (') tcconilMnaiU I'fi'^K-l and D) 
tcciunbuianl Ibll usiiii^ i^rotoiois sulisunUiallv as described ni f \ani[dc .vA [tatal ,\/ 
lubercnlosis lysate. concentrated supernatant of A/ (Nhcrcubi^is cuilurcs and the reconibinani 
antigens S5b. i bll 9 and 1 b3(S 1 \^ere rcsoUed on denaturing gels, immobili/ed on 
nitrocellulose membranes and duplicate blots were probed usmii the rabbit sera described 

The results of tli!^ anab<is M^!iii' c>M^'r>d I ( T^n. M .m^- 
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residues and would therefore be expected to migrate vvitli a mobility approximately 1 kD 
larger that the native protein. In Figure 2D, recombinant TbM-9 is lacking approximately 10 
kD of the full-length 42 kD antigen, hence the significant difference in the size of the 
immunoreactive native TbH-9 antigen in the lysatc lane (indicated by an arrow). These 
3 results demonstrate thai Tb38-1 and TbH-9 are intracellular antigens and are not actively 
secreted by M. tiibercidosis. 

Ihc finding that TbH-9 is an intracellular antigen was confirmed by 
determining the reactivity of 1 bl 1-9-spccific humaji T cell clones to recombinant TbH-9, 
secretor)' hi tuberculosis proteins and PPD. A l'bH-9-specific T cell clone (designated 

10 131TbII-9) was generated from PBM(' of a healthy PPD-positivc donor. The proliferative.' 
response of 131TbH-9 to secretory proteins, recombinant TbH-9 and a control A/. 
tuberculosis antigen, TbRal 1, was determined by measuring uptake of tritiatcd thvniidine, as 
described in I^xample I. As shown in l-igure 3 A, the clone I3ITbH-9 responds specifically 
to TbH-9, showing that TbH-9 is not a significant component of A/, tuberculosis secretory 

15 proteins, figure 3B shows the production of IfN-y by a second 'I'bH-9-specific 'f cell clone 
(designated PPI) 800-10) prepared from PBMC from a healthy PPD-positivc donor, 
ibllowmg stimulation of the 1 cell clone w ith secretory proteins, PPD or recombinant TbH-9. 
riiesc results fuilher confirm that TbH-9 is not secreted by M tuberculosis, 

(_ [si (.)} S\-]( A fKOM PaTI! MS HA \'l\(i f,\IR\[M'[ MO NAK V I I'l U !vf 'JM /A"^ ^ I^' I DIM \]\ 

( icnomic DNA was isolated from \f (uherculoMs fidman strain, landomh 
sheared and used to coristruct an expression iibr;u\ emphumg the 1 ambda /AP expressidn 
2."^ sNstem (Sirataeene. fa Jolla, I he lesultme lihiar\ wa^ sereetied usuii' pools dl sei'a 

ohtameJi tr(Mn nuiieiJiiaU with e\tra!vahnon;ti\ fif^-r/'ih ^ :i ■ ' ■ ' ■■ ■ ' 
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153, respectively, with the 5' and 3' DNA sequences for XP32 being provided in SEQ ID 
NOS: 1 54 and 1 55, respectively. The predicted amino acid sequence for XP14 is provided in 
SEQ ID NO: 156. The reverse complement of XP14 was found to encode the amino acid 
sequence provided in SEQ ID NO: 157. 
5 Comparison of the sequences for the remaining 14 clones (hereinafter referred 

to as XPI-XP6, XP17-XP19, XP22, XP25, XP27, XP30 and XP36) with those in the 
gcnchaiik as described above, revealed no homologies with the exception of the 3' ends of 
XP2 and XP6 which were found lo hear sonic homology to known M. tuberculosis cosniids. 
The DNA sequences for XP27 and XP36 arc shown in SEQ ID NOS: 158 and 159, 

10 respectively, with the 5' sequences for XIM, XP5, XP17 and XP30 bemg shown in SEQ ID 
NOS: 160-163, respectively, and the 5' and 3' sequences for XP2, XP3, XP6, XPI8, XP19, 
XP22 and XP25 being shown in SEQ ID NOS: 164 and 165; 166 and 167, 168 and 169; 170 
and 171; 172 and 173; 174 and 175: and 176 and 177, respectively. XPl was found to 
overlap with the DNA sequences for l^bH4, disclosed above. The full-length DNA sequence 

15 for rbH4-XPI is pfcvided in SEQ ID NO: 178. This DNA sequence was lound to contain an \ ^ 
open reading frame encoding the amino acid sequence shown in SEQ ID NO: 179. fhe 
reverse complement ol'TbII4-XPl was found lo contain an open reading frame encoding the 
amino acid sequence shown m SI'Q 11) NO: ISO. The DN.X sequence \\n XP36 was found to 
contain two open re;idini' frames cncoJinL^ the jmino acid sequence sho\\ii m Sl (,) 11) NOS: 

,^0 I XI and 1 S2. uilh t[ie [everse >niplenient cnntamiiiL' .iii dpcn readmi- frame encodinu tlie 
aniiiui acid sequence shnw ii m SI (^HI) N( ) IX > 

Recoiv.binanl XPl [iioteiii was piepaied a^ dociihL'd aht>\'e m f xample 
with a metal ion aitimtv ehroinaiography cokimn fieinu employed fni jint ilication 
Recombinant X[*l was lound to snmulate cell proliferation and !f \-v production in 1 cells 

25 isohited from an \/ /;//\7'e;v/( m/s 'innume donors. 
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serological activity with a serum pool from M. f nherculos is-infccXcd patients which showed 
little or no immuiioreactivity with other antigens of the present invention. Rabbit anti-sera 
was generated against the most reactive fraction using the method described in Example 3A . 
The anti-sera was used to screen an M. tuberculosis Erdman strain genomic DNA expression 
5 library prepared as described above. Bacteriophage plaques expressing immunoreactive 
antigens were purified. Phagemid from the plaques was rescued and the nucleotide sequencer 
oi the M tuberculosis clones determined. 

Ten different clones were purified. Of these, one was found to be TbRa35, 
described above, and one was Ibund to be the previously identified A/, tuberculosis antigen, 

10 HSP60. Of the remaining eight clones, six (hereinafter referred to as RI)IF2, RI)IF5. RI)IF8, 
RDIFIO, RDFFT 1 and RnTFI2) were f^und to bear some similarity to previously identified- 
A/, tuberculosis sequences, fhe detcnnined DNA sequences for RD1F2, RDII'5, RDIF8, 
RDIFIO and RDIFll are provided in SHQ II) NOS: 184-188, respectively, with the 
corresponding predicted amino acid sequences being provided in SEQ ID NOS: 189-193. 

15 respectively. \he 5" and 3^ DNA sequences ibr RDII'12 are provided in SFX> ID NOS; 194 
and 195. respectively. No significant homologies were found to the antigen RDlF-7. The 
determined f)NA and predicted amino acid sequences for RDIF7 are provided in SItQ ID 
NOS: 196 and 197. respectively. One additional clone, referred to as RDIF6 was isolated, 
liowcver. this was found to he identical to 

20 Rcconihinaiu KDIKk KDIfS. KDlilO and RDIFM were prenared as 

desciilu'd .ib()\L' aiiiii'ci:.^ wl-ic fviuiid >liiiiuiaic ccli proh teratioii aiui IfN-y 

[nnLhictitin HI I c^dN isolated lioin \/ (W/Zr^s/.v-inimune donors. 
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PPD was prepared as published with some modification (Seibert, F. et al.. 
Tuberculin purified protein derivative. Preparation and analyses of a large quantity for 
standard. I hc American [Review o f Tuberculosis 44 :9-25, 1941). A/, tuberculosis Rv strain 
was grown for 6 weeks in synthetic medium in roller bottles at 37°C. Bottles containing the 
5 bacterial growth were then heated to lOOX' in water vapor for } hours. Cultures were sterile 
filtered using a 0.22 ft illter and the liquid phase was concentrattnl 20 times using a 3 kD cui- 
off membrane. Proteins were precipitated once with 50% ammonium sulfate solution and 
eight times with 25% aninioniuni sulfate sohiti(^n. The resulting proteins (PPD) were 
fractionated by reverse phase liquid chromatography (RP-HPLC) using a CI 8 column (7.8 x 

10 300 niM; Waters, Milford, MA) in a Biocad HPTC system (Pcrscptivc Biosystems, 
I'ramingham, MA). I'ractions were eluted IVom the column w^ith a linear gradient from 0- 
100% buffer (0.1% ^fFA in acelonitrile). 'Ihc How rale was 10 nil/minute and eluent was 
monitored al 214 nm and 280 nm. 

Six fractions were collected, dried, suspended in PBS and tested individually 

15 in A/ ^;//?<:vx'i//fM7'v-infectcd guinea pigs for induction of delayed type hypersensitivity (Dili) 
reaction. One fraction was found to induce a strong DTH reaction and wais subsequently 
fractjonatcd further by RP-llPfC^ on a microbore Vydac CM 8 column (Cat. No. 218TP51 15) 
in a Peikin I' Imer .Xpf^ilicd Bios\'stems I)i\ision Model 172 HPLC. Fractions were eluted 
witli a linear uradiciu from :"^-10f)^^ iMitlei (f).f).'^"(i TF.A m acct(mitrilc) wwh a flow rate of SO 

20 ui minute. I liienl was mornloi'ccl al 215 inn Fu^hl frat:t;ons were collected and tCNted lor 
iiuluelu>ri Cm 1)111 in ,1/ /?//icrc;^/f j.v/,s -iiilecled guinea pigs ( )ne Ir.iction was toniul to induce 
strong 1)1 11 ot about 1(> inni induration. 1 he other tractions did not induce deteetalilc Dfli. 
The posiliNC Traction \sas submitted to SI)S-P/\(jI' gel electrophoresis and IoliirI to contain a 
single fuotein IxukI oI approxunateK 12 kl) molecular weight. 

2"^ ] h\> [^ol) [K'ptkie, herein alter ivlerred \o a> DIMM), was sequenced troni ihe 
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HXAMPLF 5 
Synthesis oI' Synthi-tic Poi^yphptides 

Polypeptides may be synthesized on a Millipore 9050 peptide synthesizer 
using FMOC^ chemistiY with MPTl I {('VBcnzotriai'.ole N,N,N',N'-Letiaineihyiuroniiim 
hcxafluorophosphatc) activation. A (Jly-C^■s-GIy sequence may he attached to the amino 
terminus of the peptide to provide a method of conjugation or labeHng of the peptide. 
Cleavage of the peptides from the sohd support may be carried out using the following 
cleavage mixture: trilluoroacctic acid;ethanedithiol:thioanisolc:watcr:phenol (40:1 :2;2:3).' 
After cleaving for 2 hours, the [K'ptides may he precipitated in cold methyl-t-butyl-ethcr. The 
peptide pellets may then be dissolved in water containing 0.1% trifluoroacetic acid (TI*A) and 
lyophilizcd prior to purification by CI 8 reverse phase HPLC. A gradient of 0-60''/o 
acetonitrile (containing 0.1% TFA) in water (containing 0.1% TFA) may be used to elutc the 
peptides. Following lytiphilization of the pure fractions, the peptides may be characterized 
using clectrospray mass speclrometn and by ammo acid analysis. 

I his procedure was used to synthesize a TbM-l peptide that contains one and 
a half repeats of a FbM- 1 sequence. The TbM-1 peptide has the sequericc 
(iC(il)KS(.(i\FI)(,)IRl i(l(I)KS(.(,\'I. (SI (,) II) NO: (y^). 



f-XANHMd' () 

1 'Si OI K [[Masi-: N rATl\'! A M K^IXS H )K St-KOf )1A(.\() S1S O! T i • tii. KClI.OSlS 



i hi:- I xaiTipk' iliustralcs tlic diaunoNlic propcidc^ n!' sr\LM;il tepi\-s-!it ii !\ 
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limes with PBS/0. 1 % Twecn 20^". 50 |.iL sera, diluted 1 :100 in PBS/0. 1 % I wccn 20^^/0.1% 
BSA, was then added to eacli well and incubated for 30 minutes at room temperature. The 
plates were then washed again five times with PBS/0. 1% Twecn 20^^. 

Tlie enzyme conjugate (horseradish peroxidase - Protein A, Zymcd, Sail 
5 Francisco, CA) was then diluted 1:10,000 m ]^DS/0.1% Twecn 20^*^/0.1% BSA, and 50 \iL of 
the diluted conjugate was added to each well and incubaicd for 30 minutes at mnni 
temperature. Following incubation, the wells were washed five tmies with PBS/().I% I ween 
20^^^. 100|.iL of tetramelhylbenzidine peroxidase (I'MB) substrate (Kirkcgaaid and Perry^ 
Laboratories, Gaithersburg, MD) was added, undiluted, and incubated for about 15 minutes. 

10 The reaction was stopped with the addition of 100 pi. of I N IFSO. to each well, and the 
plates were read at 450 nm. 

f igure 4 shows the KLISA reactivity of two recombinant antigens isolated 
using method A in f^xample 3 ( 1 bRa3 and TbRa9) with sera from M. tuberculosis positive 
and negative patients. 1hc reactivity of these antigens is compared to that of bacterial lysate 

I'l isolated from \{ tuberculosis strain H37Ra (I)ifco, Detroit, MI). In both cases, the 
recombinant antigens differentiated positive from negati\'e sera. Based on cut-off values 
obtained from receiver-operator curves, fbRa3 tletected 56 out of 87 positive sera, and 
TbRaO delected 1 1 1 out of 165 positive sera. 

fiLMirc illustrates the Fl IS.A reaetiv U; of :e[>[L-sefiIati\e antirens isdhiied * 

2n usm:' nietlKni B ol ['Xaniple .v 1 he reaciiMlv dt' the recoinbinaiil antigens 'ff^IM, ThI 1 ] 2. 
lb>K i and the peptide IFM 1 (as Jiescribeii in i'xample 4) is eiMupared lo tfiat cil'the 3S kl) 
air.i^^en deseribed by .Andersen and Hansen, hifccl Inunu}} > ":24X 1 -24KS. 1^8'). .'\gain. all 
ot the poly peptides tested differentiated positi\e from negative sera, ikised on cut-off \alues 
obtained Irom receiver-operator curves. rbn4 detected 67 out o[ 126 [positive sera, lb} II 2 
Je:ecfed ^0 of 1?^ positive sera, .'^S-1 detected 01 ou[ ot KM poMiive sera and the I b.\l I 
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to the reactivity ofM tuberculosis lysate and the 38 kD antigen. The resuhs are presented 
Table 3, below: 

T ABLE 3 

5 RHACTiVITV ()!■ ANTK^KNS WITH SBRA f-ROM M T U /U-Rd/J.OS/SVATlENT^ 





Acid 
Fast 


r-:LlSA Values 


Patient 


Sputum 


1 .ysatc 


38kD 


TbRa9 


TbH 1 2 






Tb01R93I-2 




1.853 


0.634 


0.998 


1.022 


1 .030 


1 314 


Tb01B93M9 




2.657 


2.322 


0.608 


0.837 


1.857 


2.335 


TbOin93I-8 




2.703 


0.527 


0.492 


0.281 


0.501 


2.002 


Tb01B93M0 




1.665 


1.301 


0.685 


0.216 


0.448 


0.458 


rb()lB93I-]l 




2.817 


0.697 


0.509 


0.301 


0.173 


2.608 


Tb()lR93M5 




1.28 


0.283 


0.808 


0.218 


1.537 


0.81 I 


Tb()lB93I-16 


.f 


2.908 




0.899 


0.441 


0.593 


1.080 


Tb()lB93I-25 


■f- t 


0.3^)5 


0.131 


0.135 


0.21 1 


0 107 


0 948 


lb(llB<).>N.S;^ 


j 








2.2S2 


0.^)77 




0.857 






1/^12 




2.43() 


0.870 \ 


0.520 


0.952 


rh{)l}^t)41-l()H 


f — 1 


1,()3^) 


0.^.41 1 


0.707 


0.368 ! 


0.654 


0.798 


Th()lB^J4I-2()l 


1- ^ 4 


1.721 


0.419 


0.601 


0.137 


0.004 


0.692 


rhf)]l^^).^^l-88 


1 


1 




2.51^' 


i.";<s] \ 


0.2 14 


o..^3o 


T I. I ll i)( 1"; T * ) ^ 
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Acid 
Fast 


EIJSA Values 


Patient 


Sputum 


Lysatc 


JoKlJ 


! OKaV 


i uli 1 1 


1 Dhl4 


TbRa3 


Tb01B931-9 




n A /If) 


U.Z/6 


U.Z 1 U 


A 1 /f A 

0. 140 


A 1 0 1 

0. 1 81 


1.586 


Tb()lB93I-14 






1 ^ 1 1* 


U.Zoz 


A OA 1 

U.2VI 


0.549 


2.880 


Tb01B*^3!-21 




1^ * i~ 


IJ.739 




\ 

A TO T 

0. /83 


0.536 


1.770 


TbOl 893 1-22 




n T t '1 


I). 4 5 1 


2.082 


0.285 


0.269 


1.159 


Tb01B<)3I-31 


- ' ■ — 




t}.49u 


1.019 


0.812 


0.176 


1.293 


lb()lB93I-32 




J.zo 1 


f\ TO/' 

0. /80 


0.668 


0.27.^ 


0.535 


0.405 


7b()lB93I-52 


- 


U . U J) n 


U.J 


C\ A1A 
[f.H.iH 




W.J. 1 S 


1 . 1 40 


i b01B93I-99 


- 


z. 1 1 r) 




1 .0^ 


A 1 1 A 


0.0 / / 
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Tb01B94M30 




1.349 


0.224 


0.86 


0.282 


0.383 




rbOIB94I-I31 




0.685 


0.324 


1 . 1 73 


0.059 


0.118 


1 .43 1 


AT4-()070 


Ncimiai 


0.072 


0.043 


0.092 


0.071 


0.040 


0.039 


AT4-()105 


Nomtal 


0.397 


0.12] 


0.] 18 


0.103 


0.078 


0.390 


3/15/94-1 


Ncimiai 


0.2:7 

! 


f).()64 ! 

1 


0.098 


0.026 


0.001 


().22S 


4 1> 93-2 


Normal j 

j 


0.114 \ 




0,071 j 


O.Ovl j 


D.O'll 
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5:<)94-4 


\orniai 




0.259 [ 
1 


0.09(> 


0.046 


n.oo<s 


0.053 


5 26 94-3 


N\>rnial 


0.139 j 


()()9'^> ! 

1 


o.os.^ 


0.019 


0067 


0 01 



Based AM cin-ott (^hiaiiirJ trmn iwrivri-opnatdi ctirvcv ThKa 

detected 2'^ (nil tW" 7'^ pos!ti\ e ^er;i, ThK.;') d -f:^■^^ ' ^ ^ ^ - '^ < ' • ■ ' ' ' 
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The reactivity of the recombinant antigen IbRall with sera from 
M tuberculosis patients shown to be negative lor the 38 kD antigen, as well as with sera irom 
PPD positive and normal donors, was determined by CLISA as described above. The results 
are shown in Figure 6 which indicates that ThRal 1, while being negative with sera from PPD 
5 positive and normal donors, detected sera that were negative with the 38 kD antigen. Of the 
thirteen 38 kD negative sera tested, nine were positive with TbRali, indicating th.nt this 
antigen may be reacting with a sub-group of 38 kD antigen negative sera. In contrast, in a 
group of 38 kD positive sera where TbRal 1 was reactive, the mean CD 450 for TbRal 1 was 
lower than that for the 38 kD antigen. The data indicate an inverse relationship between the 
10 presence of TbRal 1 activity and 38 kD positivity. 

The antigen 1 bRa2A was tested in an indirect ELISA using initially 50 /.il ol" 
serum al 1:100 dilution for 30 minutes at room temperature followed by washing in PBS 
Twecn and incubating for 30 minutes with biotinylated Protein A (Zymed, San Francisco, 
CA) at a IT 0,000 dilution. Following washing, 50 of streptavidin-horseradish peroxidase 
15 (Zymed) al IT 0,000 dilution was added and the mixture incubated for 30 minutes. After 
washing, the assay was developed with TMB substrate as described above. The reactivity of 
fhRa2A with sera from A/ luhcrculosis patients and normal donors in shown in 'fable 4. The 
mean value \ov reactiviiy of 'fbRa2A with sera trom A/ luhcrculosis patients was 0.444 with 
:i staixlard dev iation of 0.300 [ he mcdu tor i\\uMi\ii\ \^ith sera Irom normal donors was 
20 0.100 with a standard dc\ianon of 0.020. Tcstiiit: ot >S kl) nceaUxe sera (f iuure?) also 
milica.ti'ii [\u\ I hjv.a.W anliucii was capatMe of dclcctitiLi sera ui this calegorw 

1 ABLf 4 

Ri Acmjj > f )i l uR \ WI TH \ rK()\: \f^__[2JUJ^' /' ^^'Z^' Pa i M N [ s A M) } R( )M Norma; 

l)()N(lRS 
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1 591 


1 B 
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Tb92 


I'D 
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TB 
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TB 


0.333 


Tb95 


TB 


0.435 


Tb96 


TB 


0.284 


Tb97 


TB 


0.320 


Tb99 


TB 


0.328 


Tbiuu 


IB 


0.817 


^rbioi 


TB 


0.607 


lbl02 


IB 


O.HM 
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TB 


0.228 


Tbl()7 


TB 


0.324 


Tbl09 


TB 


1,572 


Tbll2 


TB 


0.338 


DI .4-0] 7() 


Normal 


{).03o 


A'r4-004.1 


Normal 


0.1 2t> 


A 14-0044 


Normal 


0.130 


A14-0052 


Normal 


OT35 


A 14-0053 


Normal 


0.133 


A'r4-0062 


Normal 


0.128 


Ar4-0070 
"a1 '4-00^)1 


Normal 


O.0S8 


Normal 


0.108 


A 14-0 100 


NvKma! 


0.106 


A 14-0 1 05 


Normal 


0.108 


A'r4-0KK) 


Normal 


0 lOS 



! lu- itMctix it\' ni \\]c I oc( Mil hi n;)m iiiitii'L'ii ( sm ( SI'(} f I ) \(): (>()) with sera troni 
A/ luhi'fct^loMs palicnls aiul iioioki! Jtinors \saN dclnfiiiiUHi b\ M !S \ as dc'snihcd abovt- 
I'luurc 8 sh(>^^'s the results of tlic titratidii tiraiiliLicn iii) with ioui .\f fithcrculoMs posituc 
5 sera that were all teacli\e with the ><S kP antigen arul with lour donor sera. -Ml four positi\'e 
sera were reaetiw with anti_i:en (l'). 

Ihe reacliMtN ol tiie leeoinbinain antieeti I bll-!^^^ (Sh(,) ID \(): wwh 
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donors and with a pool of sera from M. iuhcrculosis patients. The mean OD 450 was 
demonstrated to be higher with sera from A/, tuberculosis patients than from normal donors, 
with the mean OD 450 being significantly higher in the indirect ELISA than in the direct 
ELISA. Figure 11 is a titration curve for the reactivity ol' recombinant TbH-33 with sera 
5 from A/, tuberculosis paticfits and from nonnal donors showing an increase in OD 450 with 
increasing concentralion of antigen. 

The reactivity t)f the recombinant antigens RDIF6, RDIFS and RDIFIO (SEQ 
ID NOS: 1 84-187, respectively) with sera from M. tuberculosis patients and nonnal donors 
was dctcnnincd by HLISA as described above. FU3IF6 detected 6 out of 32 M. tuberculosis 
10 sera and 0 out of 1 5 normal sera; RDIF8 detected 14 out of 32 M. tuberculosis sera and 0 out 
of 15 normal sera; and RDIIHO detected 4 out of 27 M tuberculosis sera and 1 out of 15 
normal sera. In addition, RDIFIO was found to detect 0 out of 5 sera from PPD-positive 
donors. 

15 fiXAMPLh: 7 

[^r[-:paration ano ('haracthkization of M. Tt^Hfrncmosis Fusion Prothins 

A fusion protein ctuitainiiig rbRa3, the 38 kl) antigen and rh38-l was 
prepared as iollows, 

20 luicli v^t'lhc DNA consiriict.s TbRa\ .>S kl) aiui 1 h.^N-! wore nioditicd h\ P("R 

in order to lacilitate ihcu" lusion and tlvj Mibscqiienl expression of the fusion prolem ThRa"^- 
kD'Th.^S- i. 1 bRa.i, 3S kl) aiul I b'^S 1 DNA was used U> perlorni PC R using the primers 
in)M-64 and Pl)M-6> (Sk.U H) NO; 141 and 142), PI)M-57 anil Pi)M-58 (Sh;Q ID NO: 143 
and 144), and Pl)M-f>9 and PI)M-bf) (SI O II) NO; M^-14()), respective!) . In each case, the 

2.-^ DNA ainpMfu-atioii was pcrtorined usin- ]() M)X IMu huftei-. 2 ul K) niM dNTPs. : ^1 each 



l.ih.t .in. j' ^ < 
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68°C for 15 sec and 72°C for 3 min, imd finally by 72T for 4 min. For Tb38-I dcnaturation 
at 94°C for 2 min was followed by 10 cycles of 96X^ for 15 sec, 68°C for 15 sec and 72°C for 
K5 min, 30 cycles of 96T for 15 sec, 64°C for 15 sec and 72°C for 1.5, and finally by 72"C 
for 4 min. 

'fhc Tbka3 PGR fragment was digested with Ndcl and fxoRI and cloned 
directly into pr7'i.2 If 1 vector using Ndel and fxoRI sites. The 38 kD VCR fn lament ii'^r; 
digested with Sse8387I, treated with 1'4 DNA polymerase to make blunt ends and then 
digested with HcoRI for direct cloning into the pl7'"^L2Ra3-l vector which was digested with 
Stul and HcoRf The 38-1 PGR fragment was digested with I{co47IIl and HcoRI and directly 
subcloned into pT7^L2Ra3/38kD-l 7 digested wath the same enzymes. The whole fusion was 
then transferred to pr:T28b using Ndel and licoRl sites. The fusion construct was confirmed 
by r3NA sequencing. 

The expression construct was transformed to 13LR pLys S E. coli (Novagen, 
Madison, WI) and grown overnight in LB broth with kanamycin (30 |.ig/ml) and 
chltiramphenicol (34 ^ig/ml). ^Ilus culture (12 ml) was used to inoculate 500 nil 2XYT with 
the same antibiotics and the culture was induced with IPTCi at an 01)560 of 0.44 to a fnial 
concentration of 1.2 mM. four hours post-inductinn, the bacteria were harvested and 
sonicated in 20 mM Iris (8.0), 100 mM NaGl, [)()(\ 20 ^gAnl feupeptin. 20 mM. 

}*MS1' tolloued h\ ccniriiuL'ation at 26,000 X l-. 1 he rcsulling pelict vsas resuspendcd in K M 
urva. 21) niM Iris (S.O), 100 niM NaGl and bound to Pro-bond nickel resin (Invitrogen. 
Garlshad, ( A). [ he e^'iiirnii \va:> ua;.]icd sc\era! tnncN with the nbow (nifter then eluted wuh 
an unida/olc gradient (50 mM, lOf) mM, >()() niM imida/ole was added to S M urea, 20 mM 
Ins (8.0), ]0() mM Nad). I he eluatcs conlamuig (he protein of interest were then dial/ved 
against 1 0 mM Tris (SO) 

i he DNA aiui amino aeul soiniencL's for the iL'sultnij' jnsiofi judtoin 



wo 98/16645 



53 



PCT/US97/18214 



procedure to that described above. Tlic DNA sequence for the rh}19- rb38-l fusion protein is 
provided in Sl^Q ID NO: 151. 

A fusion protein containing I hRal, the antigen 38kD, rb38-l and DPEP was 
prepared as follows. 

Each of the DNA constructs TbRa3, 38 kD and Tb38-1 were modified by PCR 
and cloned into vectors cssentiaiiy as described above, with the primers PDM-69 (SEQ ID 
NO: 145 and PDM-83 fSEQ ID NO: 200) being used for amplification of the Tb38-1A 
fragment. Tb38-1 A differs from rb38-l by a Dral site at the 3' end of the coding region that 
keeps the final amino acid intact while creating a blunt restriction site that is in frame The 
TbRa3/38kD/^rb38- 1 A fusion was then transferred to pin78b using Ndel and EeoRl sites. 

DPEP DNA was used to perform PCR using the primers l^DM-84 and PDM- 
85 (SEQ ID NO: 201 and 202, respectively) and 1 f^il DNA at 50 ng/^l. Denaturation at 94 
w^as performed for 2 min, followed by 10 cycles of 96 "C for 15 sec, 68 Xl for 15 sec and 72 
T for 1.5 min; 30 cycles of 96 T for 15 sec, 64 T for 15 sec and 72 "C for 1.5 min; and 
finally by 72 T for 4 min. The DPIvP PCR fragment was digested with IxoRl and Eco72I 
and clones directly into the pfyr28Ra3/38kD/38-l A construct which was digested with Dral 
and EcoRl. Ihe fusion construct was confirmed to be correct by DNA sequencing. 
Recombinant protein was prepared as described abo\'e, The DNA and amino acid sequences 
for the resulting fusion protein ( hereinafler referred lo as TbI'-2) are provided in Shi) ID NO: 
203 and 204, respectively. 

I'XAMPIi:8 

I'^^f f ^' U i rni R<'( fn s/s\'\^SH )\ PKO t IINS ! OR 

S [ ■ i< ( ) ; M x t , \ ( )sis ni li m n ( > s i s 

[ iie ctVccii\ oncss (if the fusioii pnUcm rbRa.''^->S kO^ I h^^fs- 1 . picpared as 
dcscnhcJ abo\ c. ni t^.L' scrfHliai'iutsis of uibetvuiosis mtecluni was examined b\ IT, ISA. 

1 lie i : IS \ proioeo! wa- aeseriheJ aboxe n; 1 xaniple d, wiih :he fusion 
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the three antigens individually or in combination. Such a panel enabled the dissection of the 
serological reactivity of" the fusion protein to determine if all tiirce epitopes functioned with 
the fusion protein. As shown in Table 5, all four sera that reacted with TbRa3 only were 
detectable with the fiision protein. Three sera that reacted only with Tb38-1 were also 
5 delectable, as were two sear that reacted with 38 kD alone. The remaining 1 5 sera were ail 
positive with the fusion protein based on a cut-off in the assay of mean negatives +3 standard 
deviations. This data demonstrates the functional activity of all three epitopes in the fusion 
protein. 

10 I MILE 5 

RHA(^riv[i Y OF Tri-PhptioI' I-usic)\ Protean w[7 i [ Si:ra from A/ riii^KRciiLosis PA riENTs 
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289004 
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The rc;iLli\'it\' of the fusion pi-otcin I'hl -f* w ith .sera lioni M liihcycuh)sis- 
iiilecleJ patients was exaniitied h\ I I. IS \ u^aww the p:\)loei>l Jeserihed abtue I he re,-.uhs o\ 
these Ntiuiie^ i I .if^h- ,K-n.:>M.,'- 'i^ .i , ,, , , ... ; ; . .i .■ , 
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Tabli: 6 

Ri^ACTivrrv Of TbF-2 Fusion Prothin vvi rn TB and Normal SliRa 
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One ot'skiil in the an wiil appreciate that the order of the individual antigens 
within the fusion protein may be changed and that comparable activity would be expected 
provided each of the epitopes is still functionally available. In addition, truncated forms of 
the proteins containing active epitopes may be used in the construction of fusion proteins. 

I-rom the foregoing, it will be appreciated that, although specific embodiments 
oJ' the invention have liecn described herein for tlie purpose of illustration, various 
modifications may he made without deviating from the spirit and scope of the invention. 
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(1) GtNKRAL IN[-r)KMATION; 

{!) APPLICANTS: Reed, Steven G. 

Skeiky, Yasir A.W. 
Dill on , Da vin C . 
C:nmpo.s -Not n. Ant on] a 
fiouqht on, [<<i yinorid 
Vo(ivick, Thomas . 
Tward:: J- }; , Daniel H. 
:aj(Jos, Michaoi J. 

(lij TITI.r (jF INVKNTJON: t a )MPC;DN ; AND MKIflOrX) VOl{ L'lAClNOSIS OP' 

uv^ roRKi::-;L-'oronF::;rr: addhk:o;: 

(A) AI):)RK^;SKK : SKED and HElRKV LL,P 

(H) jrrREFT: b300 ('olamo:a r.ait.or, "'01 ViiX.h Avonao 

{ I ) ) S T A? K : Wa s m n q t. on 
(K) XiCNTHY: USA 

Z"i;o ^un ;m -vo^);' 

;v) COMPUTER KKADAiBPK Pf)KM : 

{A; MPDIUM TYPrA l-'loprv ciial: 
(in A()MPUTr;K: IBM P^' cornrMt. iro^' 

Aj. :>( .)I-"rWA;A', : Pa^nririn R-;..*:'- « 1 A ' , (i ! . A^ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CGAGGCACCG GTAGTTTGAA CCAAACGCAC AATCGACGGG CAAACGAACG GAAGAACACA 6 0 

ACCATGAA'^^A TGGTG/iAATG C ATCGC'^GGA GCTCTGACCC; C ::C;C:GGCTGC AATCC^GGGCC 120 

gctggggG'::g gtgtgacttg riATCATi'^crr cc;(:gc;ccc(;(; TccrrATAGCA gatgcagccg ]so 

GTCG'VCTTCG GCGCGCChCT GCCGTTl^GA' ' CCC^GC/KTCCG CG ':r:Tr;AGG'r r:rrGACGGCC 2 4 0 

GGCCAGTTGA CCh(j, [a./ [■(,,{:■[ GAAGAGGCV(^ GC^'(]A'ycrCf\ ACG'IxrPGCn'T Tf ■CC;AACAAG 3(jfi 

GiGCAGTrTGG TC ::;AGGGCc; S :^AT(:GGf;GGG AC:"GAGGG(;G GCA'rCGG(X.;A GGACAAGCTG iGii 

aagaa:^ :;crx; ccg^a:.cagg'.; G;;AT'C'rGcr'(; crGTCcvrycA ^vG'vn^ACGAA cfVYccAGccG a^o 

GCGGCCGCCG GTTC'"^GCCAG ("'G :":GGAC:GTT TCGGTGTGGG G'l-:CGfu\GGG GTCGTGG ;:G:'^ 4^10 

GTCACGCAGA ACG7CACGTT CGTGAATCAA GC;r'GGGTGGA TGC'IGTCACG GGCATCG^.CG ^10 

ATGGA;:;'rTGG TGGAGGCGr:c A';!'-GNAArTi^^ AT7'GG^'G(^GG r'GGNTTGACiG GGCX^TGTTGA " bOO 

GCTAGGGGC;C CC^GGGTGG'l'C^ A'.'G(;GTGCAT GTGGA/\(^ACT TGGr;GGTGTA GCAGr;GTGc:G b60 

GGiVrGGGrj^^f] ( 'G'^CGGhCGG. ACr(]rrc:^(:■\■ '•/y^A::rGG'r( ^ •^'yGGAGA'l'/.G G'GMV[GK(''\'C G';' 

''■^■■■A' 'GA' :fiG ArG7G^("^"'':(":; :::jt:^'-N(" -GJ'^' """I'-Gfj:- ;gt ggagga -'.g. 

'G: ■ : \G')Vyj\: :G[i V:>G [G\G ! W^:. : 

! A; \G.G\G W : ' S,: y--\ : r 
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77TCTCGACG AC f^TGACC^Vr ( ^AGCCC^TCGC: GATGC'I'GAAT TCCC:G'r'rGGA AAACAA"GAA 30 i; 

TTGAATGTCG 'rG:^ATC;TGGG GAGTGTCA;^.: GGCACCTACC: TGAAGGGGGA GGCCC^TGGAT 3bG 

TCGGCGGTGC TGGCGAACGG CGACGAGGTG CAGATCGGCA AGCTCCGGTT GGTGTTCTTG ^20 

ACCGGACCCA AGCAAGGCGA GGATGACGGG ACTACCGGGG GCCCGTGAGC GCACCCGATA 4 80 

GCCCCGCGCT GGCGGGi';ATG TGGATCGGGG GGGTCCTGCG ACCTGC^TAC:G AGCGGATTTT b^O 

CCCTGATGTC rACCATi":TCC AAGATT^:GA'^ TCTTGCiGAGG CTTGAf.^GGTG N:;GGTGACGC bOO 

c(:('cG'":gggc g'yc^ttcsgg (xtmtc-.::ggn ';GTTTf:A(.:(:G gntagcnagt gi::c:ncx:7cgn bhO 

TTGGNAATTC N'i'rCT'i^G:: '"T GG':'''"r:N/VAAG GGACC:;TTAN r'TTGCTGCTN G/i/^ANGGTNA 72t^ 

TCCNGGc;cc:c ntgct:^gaan GCCGf^TGcc:r cr 7^2 

1 ) l^KQULNCK CHAlvACTKKIGTIiCS : 

( a; LI^NGGH : 8n tvis^^ p.i i r n 
(B) TYrr. : Jiuclcjir acid 
(Gi GTF-'/.NDKLfNEGG : 5:nqie 
{ [.)) TOPGLOGY : 1 i near 



f\[-r^y- ;GA i'^ ■ A^ 'CV^T' 7.-. 

Ar;-7:'-GA7G g:^:":'A7G': 



'-[^.l 1 = 7 



':''-gtG';7'":a; 



in;) 
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GAGCAAC(;:v\ (iACxx;c;:';A(:A AC'W;'^r;'rATC">-; fxTPv^ccGcrt: /\r\'i(;(:r,GGr/:- 'rnr^AAcrcNr; vac 

TGAAATTATC ACAACTTCGC AGTCACNAAA NAA 013 
(2) INfUKMATlON FOR S1i:Q ID NO : 4 : 

(i) SEQUENCt. CHAFIACTERISTICS: 

(A) MvNC^Tilr bd^^' pai:\s 

(B) TY['L: nucltMc ac::d 
((.") ::':'^ANfJF:[^NESi: ; :;.iMqIe 



rv:(:'rA'r:;>iA:' AC(;(.A\Ai::c,; < 'ogai'aagvt ca :AG''7G?r''/ gaggt; ■]■:■;:;'-." agc;(;attgg(' 

CAl^CGTTGAT ATCGGi;.; :GTA GCGGCTT^:- -'F C^^GCTTGGGT GTTGTGGACA hCIXACGGCJvA 180 

cG'';ggca';'ga G:^crAA:':;GG ':GG'r:G;GGA': (■ :\:ggggg^ '-^ g^^aa. 'ggtcc; ggatctcgac ;>';o 

rc;i:;GGACGTG atga::gggc;g tcc;A':.:gc;ggg rGCGATGAAc; tggg(A'Aggg gc^atggggc^a 300 



wo 98/16645 PrT/lJS97/182l4 

6? 

ccxjf pCGA'::-:^c; NGAC^rGCCf ;:; AA'y(]:]C-y:iiA v;t' iagga^gt c^i ;rjrA:;':':';\r Gi'GCAi.iNcrt; v/jo 

ATCCiAATCAA (T'TGMATTC^'; f;NCTV^Ni";GC^N CCATTTCJACA A'rc:(;AGGTAG 7GAGCGCAAA 300 

TGAATGATGG AAAACGGGNG GNGAGGTCCG NTGTTCTGGT GGTGNTAGGT GNGTGNCTGG 360 

NGTNGNGGNT ATCAGGATGT TCTTGGNCGA AANCTGATGN CGAGGAACAG GGTGTNCCCG ^20 

NNANNGGtJAN GGNGTr(^NAtJ CCCNNNNTG^^ TGCUJGGANAT CANANAGNGG NTTGATGNGA 4 80 

rJA/vAACU^GTG GANCAGrJNNM AAN'lT.'CJrJGr^N GCflAANAAN^^ NNN ANN JNG NNACrJTNGNT bAO 

::iUi'rW:'Vmi(.: ANNNNNriN'l'c; NNGNNGNrJ;-N rJrJNGAANGNN rJTWNNrJGNAA rJNGGMTNTT G0() 

KA.AT CA')d 
'.2] I NKC'I^MATION [■(:[< bF.Q ] V> fJO : : 

i;) i:y.Q[]K[-ii'y. ' ■ilAKAGTL:!',! GT K/G : 

(A) NErJ'G';;: t:^ ' G^s^ ]\>: r 

(B) TYPEG nut:lui^- acid 

{ G ) frr R A t4 n K D N E 5 :-■> : ; ; 1 r 1 q 1 ' 
{[)) TOPOLOCJV: linear 



(x:; GKCnH'N^^i: GK;G:r: PG ! ON : :G:V' ![■■ IG j : h : 
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(A) LKNGTII; b.ii;^' f-a : r 

( ] TYPI': : i;wc: 1 e 1 ^ ■ ac J ci 

( C ) f: T UAUD V, I ) ^J ?: ; S : - i n r } .1 c 

(0) 'JX'POLOGY: linear 



(XI ) iiLQURNCr n'I::SCnurTJON: Si:0 lU N0:7: 

CGACCACGAr GG:'GCCGCAG AGGG(;(r^GGG AACv;ggGATC GAGGCGC;Crf"^ TGGCCAGAGT GO 

^■"7;e^GAGCAi^c^ gagc;agc;:;a:; tg'.";aa':'':'A1'G aaagttgtga AGt.:ATA7"Tc:A GGiprGT:Gi:G 120 

crrcccc(^A(] (^G'';G:^(;rGGG"^ rrrr^v:^- 'GA^: G":^^tatc;ggg agggG' ^gcg^-; "\';AG'rTGGGG InO 

gg:g^gg::ggc; ag-a:;^ ::7GG'^ GAT'.;r'GGTG'" ggggacgagc; rjArrr^GTGAr' gc;ccgggtgc^ 2^0 

gg:;A' x;tt jc; :.G(^A3AGAGr G:-7G[r:GG(.;G gaggtgcggg c;Tc;GGGr;rAA r;r;AAG2;^G'rG 300 

i^C-:T]CCGCCi] VCA^O'^GCChG GGTGGGGTCiG GPGTGGTC^GG TCA;AG(;CAGA ^:ACGAGGATri 360 

GT':7rAGGGG(^ ':a(a;ggaa.ag cohCfKCGC.cG c;ggc;cga7gt t(;ggg::;cx:ag agc^aggtggg 420 

■"^G';gc;7Gagg (::c;aag^';(:(G.g: GiVt'rciTGGr'G T:G:;GGGGGAr; GAAG:"'GGGAr AGi^Gr;rnr;r;A 4Hn 
':g-':CCGC(:ac ccvt'ycggcc^': ggatgtggg*' gG'''i.;aatagg ^:cg;ggacgt;:/ g(^g(^gaat? 5^0 

■''AGTG(:^A^r'":(^ ::'AGGGG'''Gr;T '-TG^TGGTf ■ ■:"Gr;G AGC-l/i/^A -.'■r^T TGr^rG- ^" GGGGGCiGG' ^'^ 6G'i 

-•Y^T^^GGrAA GA^-^-Grr^\;A G . 'G: ^^TA- ;^ ' A A^'^^;\Gr- ^ ;r : ^ - '/w -r VG GTG ^H) 
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(■iGACCGGACG GTCAcc:;! ;gc; :;TCAGC(:Ti ;g g^'gcc 'gaagg aa i:^g: 

(2) INPTiRMATlON FOi< SKQ l [) N(.) : H : 

(i) SEQUENCE GHARACTERISTICG : 

(A) LENGTH: lASB base pairs 

(B) TYPE: nucleic acid 

(C) STPJVNDEDNKSS : single 

(D) TOPOLOGY: linoaj 



(xi) .ShiQtJKNCK f)KSCf- T PTION: S FX) If) NDrB: 

gcgag:^A'::;g cgatatggc:; : c^gagcgtag ggaaagcgg" cGrT^GAG(^:-A gtggggcgcc, go. 

c;?Ai'c:^:.:"r :c: GcirrGAGviA-; altgac^ga^:': gc\;tc;(;a(;gg iGa.;l^gtc;g;.,: . ^GiAGf:c:(.iiGr(" ij_r\ 

G-;c;ATGAGGT r;GGG-G;T';TT TAGAT'^^ATGT A:^--:GGGAC^fX; (;GGGGGCGAG CTGGG3AGGG 180 

gtaagg-o:tt ggtggggl^tg ggggagc;agt taaa(^gtc";ag otk^ggggG'^ gtgacggtac ;mo 

':'GGGGi3AGGG GTATJT:;GTG ^Ar^;;ACGAC^■■; AG: ;GC^::GCrG' GGGCGAGTG-, AGGGG'.:GAGG 300 

Ti'lATGGAGCG ATGGGGGGGG TGTGT G;;GGG G^r^GGGACG^A GCAGTATGAG ''G^G(:;f.;';:TCGT 3 f ) 0 

Gi^AGf^GGGTG GGGG :_;AG(^r G^ TT: GGG^A'^G^ ' GArTAGG(7.A GGGGGAATT^ ^TGiG^.^AATT 4/0 
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6S 

'::gc;c:c;::g(:::;c cACCx:Cf..:AAi ; atggc;c;c7G^ ; >.;/i';-;'i':'ATi Ty'r^'iGrr^'iAA . "yGCTTCiCCc^ 12 GO 

CACTGGGTAT TCGGTACGAG AGTGAAGA^\G CCGTGGGGTT AGCCACCCGG CTCATGCGTC 132 0 

GCATACAGCA GGCGGCGCAC ACCGCATCGC GGAGGCTGGC GG7VAGAGCGG GGCGCATTCC 1380 

CGGCGTTGAC CGATAGCCGG Ti'CGCGGGG7' CGGGCCCGAG GC\GCAACGC:A GAGGTGACCT lAAO 

CGGTCGCTGC GACGGC^CA 1 4 8 
-.2] IMFORMATION FOR SKO I [J NO: 9; 

■11 sr:i.)ui:NCt: GHAi<A';rKH.i :-;t k-g : 

(A) l,F,r]GTH: S62 brise p.i i r s 

[IVi TYPF: riucG n a.- 

("} ^VrFlANDFDNEGf; : :n:iqh' ■ , 

{':>) TGPOFGGV: 1 incur 



AGGGTG':7JiT ■ ^GTG'^TGG AT ^'^Ti^GAAG' 
r.G':G;:^AGGG(; :-:r:TGGCC'-':'G G-^GATGG^- 

i > I Av 'AG GAT' 'G ■ I 1' ' j' '{ " GG' 



■:Q I 1; NG: : 

G']-';f :'G^ GG-T a; y:tacv'gag atg^vg^iggg go 

7G0TGGTAGT (GG^GATGGGG :.;'1'GC;GGATCG 120 

GGAAAGGGiGT T AGGC ;GG(.;7V. : /\A(";Gf.'t;GGrT ]8() 

■ ''^'/^A'-' "A^ ■ 7\':-G(v:(;.:-f • ( iGGGAAAG'^' ■ -] 0 



1 ■■■•/^G,,' ■': ' 
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) I N}'X)HMATiu:-J yoK y^Kv NO: : 

(i) :^E^OLIi-;NCK CHARACTER} 37 J CS: 

(A) LF:NGri[: h22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKUNESS : single 
(U) TOPOLOGY: linear 



(x.i) :.;h:oaKrJ/K rkcc]-a vrjc:: \ :-c:c iv.^ no:1(;: 

TT3ATCAGCA C:(;GGC'AAGC^C CTCACATCro TCCXn'GGGTC; TCCf\CC'rc/^<' GAATCACAAA 60 

r;A:Ac:G c;-:'g^'gaagat ccrccAAc: CTy,ccc(;[C[c c'[ccj<:.(.:<:cc GAAC;:;G'rGGA ]yo 

Gl'^'i'^GGAA )U {•■ '(VyC'i]TTC [ V;' " ' "A A' ^ T " GA' *GAG^/G- ' GG ATv"" AA' 'AG 'G^'G/GAGG Gv i \ 'C' 

TTGGTTGGGG (G'GTGGGG^l :' GAAAGGGG^ ; <";GGGGGAGGG GCG^^GGT/v/v'"' GTTGGAGGAT /4i) 

GGGTG:CG(G;G GTAG^'Gr;(Y^G ACGrGGAAcG]"^ ACGGG'GGGGA AGC^GGC^AGGA GTGATGAAGC; :U) 0 

TC:GGGGiG:G 7\ Gi rvrTGA/\AG G'[CAC;ATATA (Gg;tc^gga{g; GAGGGAAGAG GGGGGGGAGT 360 

TGGTGGTGGG ' '.r CCCrAC'YC G'GG: :T(A:7'-G GTGAr^GAT'';G ■ 7^GGGCGG"V- " GG^"GA7f:;AAG .1:g^ 



1 
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GCXTACGTGC C;ATCGTC;CCC^ G^G-TAGAGf' TTGC;A-"TA':^A AGGGGAA"GG GTCCGGTGCC^ ?40 

GGGGTGACCC AGTTTCTCAA GAAGGAAACC GATTTGGCCG GGTCGGATGT CCG(;?TGAAT 3 0U 

CCGTCGACCC GTCAACCTGA CCGGTCGGCG GAGGGGTGCG GTTCCCCGGC ATGGGACCTG 3 60 

CGGACGGTGT TCGGCCCGAT GGGGATGAGC TACAATATCA AGGGGGTGAG CAGGCTGAAT 4 20 

cttgagggac cgagtagcxc caa(^atttt(^ aacggc:a':v:a tcagcgtg7'G gaatgatgca .iro 

?AGATGf:A/vG :r:GTGAAG7c ■:;gc;(:ac:gl;ag Gi'(ii.'G..;;c.A.ViA cagcc;at':ag ^'GitaT'^'ttg 'jno 

Cr^GAGCGACA A':;TGG( iGT AG GT(:GGAC;/\AG TTCGAGAAAT AGGTGGAGC^G TGTATGGAAG fjCJO 

GGGG''.^(;TGG3 ::^GAAAGGrCJG GAC;^"GAAAGG TTGAGGC;Gr:G GC(VjrG:—C7 LGGGGGCAGG GbO 

GGG^L^(:AACG GAACGTGC:.;:^' GCTAGTGi:A(; AC.:r.;AG!.j;Ai. 'G G^TGC;A7:'AG GTAGAAGGAS 7:'() 

7C;G'rGGTTTG i.:gGTGGG']7v/v CiGAGTTGAAC: ATe;GGCGA(iA T':ATC:AG(n'': " GCK:GCiG?GGG 7 f i 0 

GATGGAGTGG GGATGACCAG GGAGTCGGT^' GGT;^GA^.V^ TGGGGGGGGG GAAGATGAT:; B ^ (; 
GfiAGAAGGGA A^:GAGGT^'G■^ A'l TGGAGAGG TGG'iv"^GTT( AGAc;AC:CT AG GG Ac; GG^'GG 

Tr/T'TACrrC];^ TGGTGG'i'GmG . ,AC; ' GATGAC; ATv'GGGTGf-' C:GAAATA( C^G GC^A'rGGGAGG 9f;0 

AGGGGTAG7'r; GGGTAAGGC5G GTTTATr'GAA GGGG^X:ATTG GTGGAG(G7V. A; ./\yvf.;GGGTG ICGJG 
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ggtt:^ctcca /M:w:cr>T'W](T (\('r(^j\rG(^cc (v:r\Tr::r\ci\{: cacgttcaa^: ::a(;acgatc:g ?ao 

CCGCGACCGG CCGGCTCTCC TCGACCGAAC CCAAGGTGCA GAACATCC:(:G ATCaJGGACCG 300 

hCr;o:;GGCCG GCGGATCCGG GACGCGTTCG TGGTCGGGGA CGGTTACGCC GAGTTGATGA J60 

CGGCCGACTA CAGCCAGATG GAGATGCG^A TGATGGGGCA CCTGTCCGGG GAGGAGGGCC 4 20 

TCAT:GAG3G oTTG-AAGAGC GG:,GAGGAC(^ 'itrrAT'l'GGTT CGTGGCG-TGG CGGGTG'iTGG 480 

gt;t;cc:at :^(';A'::GAGGrc acgggcciagt TGC(;(;c\^ci\, c;L;?cAAGC;Gf.; atgtcctacg b40 

GGGT^GTrrA ■:V^G'";TTGAG^ ~ GGGTAGGGGG TGTCXX'Ar-; -a Gr7GAA.AATG TCCACCGAG:, 

AA'3r.-:AA :::;a :a:a:;atgga'.^ Gr{;TA'"""iTr; -^vvv^at--. ; ccggctgg:;-' gactacctg:: 6 60 

:;ggcj:gtagt "^■;A'.:c:c;c;(;('g :::;(:a;u;;.;a^:g gg'ivvc/.cctg r;A(.:Gr;Tr;^""rG Gri^v Yryr^GC-:' 'r:^.o 

GGVAi TT'";' i."GA'-'"'TGG A' ^ AGGAGG/Jv'^G" f.iGC'/vAGTOi' *G GG ACiT'^GC "GG' ' GAG ' 'CiGGCG? .i V.jj 

':'G("'tgaA'G;;g ccccatccfv:, gggaggl;gcsg gggagatg/iG g/iAggt:;g(g.; a':'gatgga;;g h^io 

Tr:(;AGAAi;GG G(GrGAAGGAG GG/^GAGiGI'GG rcrrGi (]C/\T r^GTGGTi";GA(-: G'l'GG AGGAi:i-; OGO 

A'-^:-TnGT-:TG GGAr* ATCG^G;* G(^GG;GCAA^: : ' 'GAv GGG' ^G ''G;AGCG^':GTG (G;G-X;GC;AGA OtG,) 

Ac^//;'(:;c;c^; a.;g G(.;c:i gacggg; ggggag^git'- > ':/:'tg(ggg;t ggggcigoaag: g/k:{;gg(:g-;ga io:g) 

GG I'GGGA. "G^" GG^ V 'A: ' T^GGSGG" 'G A : ■■"'GGCG' AG' > z^;' ir'G^; 'G^G ;GGG AT 'i' 
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':;gccgtc:gga tc;gc(;attg(^ GGA:;GT'r';\': ' ^;gigcgal\;g gt(-;gcc-G'.':"g i^A(;'"AGr;GAG g^i:^ 

ATGGAGAAG'l' GTGt^GGr/r'i'G (";c;C^:AAGG7'T ATGTCAG'I'GC; AATCTGAGGG GACGGGGGCA 30 0 

ACCTAGTTGT GCAGTTACTG TTGAAAGCGA CAGCCATGCG AGTCCACGCA TGGCCAAGTT 360 

GGCCCGAGTA GTGGGGCTAG TACAGGAAGA GCAACCTAGC GACATGACC^A ATCACCCACG A70 

GTATTGGGGA CCGGCGCAf^C AGCCGGGAAG CCCAGGTTAT G^:TCAGGG("-;G AGCAGCAAAC 4 80 

GTAGAG::CAG GAGTTCGACT GGCGTTAGCG AGGGTGGCGG G'/GGCGGAG:^ GAAGGGAGTA bA'O 

:"^:^';rGAA(:GG ta^gaggc^gt i(;:;:;GGG'rAG GGr;(.;-GG-v;GT Gr:;ArA g^'-'g G^T;rGA'rrGG 600 

GA.j;'ATGAGG GGv'c 'G7.3 '^7'G :^r; A]' G GTTGG GGAA::G:;c:Gr G:;TGGAG:iGA TGTTGoGGAT GGO 

G:r:;G(G.:G(:TG aggatag3gg ■Ti]-M:vvc:::GC ggggaT'g;cg gggggggjgg gatgggt.;gt 'g-'o 

G':;':;gt r<:.:AAG gg^xgvagggc; :\:g:; gg.:gag (;ggggg':v"-ga g';ggg ri g^-:a (g:(;g(v:;G'">:.;g 'jho 

^^.^r,c^KT('rc.r. r^GAGi::AAAGA tgcC':v:cggg gggggt<;:gaa G/\!;;(:^T3GG'..iG gc:a.^ggT'jGT B4o 

G'G'^'^AGT^TG GTC^AGC^GTGG AAAG-: j'^AGGG GGG'::i::GGC:AG T(G:;(^AGGA .v,. GGT:.G :GG:::AT 900 

GATTCTG':::': c;GG(-AGGi:;G':.: 7c;AG::rT(:;A': GAAGAACTiAr: crrG/^TGGG-G; ggggc;gcgaa 96 0 

G'::"":g!:gc7(; GGGAi.iTGi::cjG gggggaaaag gag(;gtaagg ttgtgtgAv...g G(:;gggagggg \02i) 

A' G:'GTTGA' :(■ GTri(GTG(;:JGG G'l'GAGGGGAG '"AGTGATATG GGO':;'!'; '( ;TGG C^J'GGTGAGGG :030 

^ 'OGG/:'*":': '/'.v; ; V3Gv G'/G ■■.;G0 G- J AGf'GvVG' ■ riV'.ACG' ' ' ;>0: .GG7<: ' G> ; GV( ! AA* 'AG 1 G'7 ' 
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i'l) SKC^UKNCK r'HA!-;A(:'l'F:Ki oT] CS : 

(A) LENGTIi- lObH r\isf^ fuiirs 

(B) TYPF, • nnciciL- acid 

(C) STf^tJOEDNESS : sinqie 

(D) TOPOLOGY: linear 



',:tc::'A(a.^gcc; ck^g :::;c;:'cc; ctAv.aA'"-;' act>;'-atcc^' - c-'Gr;n::Ti:;rA ggaattcggc 60 

AL(;AGGArG('^ GAi'^GIGG'^'AG ■'■;TTC;T'::G AA'"' (:(\'(\'G^'rrA' nr,hArT7\r(y:. v^TGGATGCGT 1.^0 

a;g''G(;ggga ggg-;^ :^aggg':' ■■v;gaatgg^''' ; vG:A;;TGAf:(^A ggggc^c^gaag ttgggg:;:^gg 18o 

GGGGCX^AG^:;;;; C:GAG'.:GGGG(.-- AAGGG^-GGGA l;TGAGGAiG";( ■ GGG::AC:TGA'r GGGCrAGGGTG '/.AC' 

ATGGAATi:AA r:GTr^GATTG(.; GGG'l'G' "iGGriG GGATTTC^AGA ATCGAGGTAC: TC;AGC :;CA7y\ 300 

^■GAATGAri:^G AAAACi:^GGCG GTGAGG7GGG GTGT'i G'l'GG T GGTGGTAGGT GGGTGGGTGG 3h0 

GGTTGTCiGGT ATGAGi.ATGl TGTTGC;;X:'GA AAi;^-GC;A7GG GGA'^GAAC^AG GCiTGT rGGG(3 4 2 0 

7:'^AGG(G:GAC^ G':;GGTGGGA:3 '."GGCiGGGG"^'^ G; 'GG^ 'GAGA':' ';AC;GGAGGGo GTTGATGGGA -IHfJ 

'7^AAG\(;GGG'r ga:":^'VvGGG1'' , (V^Gf^^TAG^';': ■; ;/iAG/Gv ' ,G ';a/\A' riv r;Ai 'Af/";'rTG("^ -mo 
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(xi) SEQUENCF: DESCRIPTION: SEQ ID NO : 1 5 : 

:;aattcggca cGAGAGGTr.A tcga::atc:at' cgggaccagc ccgacatcgt ciggaacaggc go 

i:jGCGG':":GGAG GGGC/rCCAC^C GGGG:X'GGGA rACX'GTGCAT C^AGATCGGCG TGGCTCGGi:;T 120 

GAT7^GAG-:'AG 'SAGA'i'GtjGGG TGGATAGCG:' C^:^GGAArjATG AC(::TAGGGGA TC^AA('^":T^(^A 180 

A:rr(;TGG I'TG aagatgaggc ggc;cgcaag:: gcggtac^cac: gggccgggga gg.aagacgca 2-\o 

A.^Xr':'(.;GAc;G GTTTGGc^GT'r gatti;c;':gc;; KY'\Tic::Grr tgctcggcga ggixgi'agcac; 3Gt) 

■G 'GGGi /:^"GA GG'i .:ggvG GT(r'^.;7A:■ : ■;'AGr:GGTGr'A '■:r.'\cy\K\'\''y' -GGC5GGCACC; 3»:g^ 

GCG.iAGTTAA KiGTTCGGC;': GC^ACGGGA/iG TGGGCGATCi.' GGCGGr^GAG^' TGATGC^ATGA A20 

(:CGTGG(^':'AG GGG':;TGGATG GGCX^AGTTG- ' GCGAC^GAAAC^ GTGGTGGGAG G'-GGG7AGGA ^80 

AG.:^GTr.:;:GTA GGG<;GGGG'rG ('■■TC;AtT^';GGT r":T^' •^-T(;rGG (:GT:-AG7GGG r:'TAGGGAGC ')'U) 

c;g '^12 

[2: I NGOIIMAG :0N VO?. GF:^ 1G fG")::*): 



GivvGi'irivi'. GriAiVv:'TK;-G ::g:;.'G : 
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GTT7\,CCGCr AATATVC.iGC ^.^(^Gr AC C (.]('( ^ hC^nCi^CCr^A,] GC..]CCiV:''.CJ\T TGCC GCCGGG 4 80 

':.AZCGf\AhCA ACACaICCAA.C GIJTGCGGCCG GCCCCGCCGT TTGCCGGCAT CACCGGGCAT b40 

TCACCGCCAG CACCGCCGTT 7VATGTTTATG AACCCGGTAC CGCCAGCGCG GCCGCTATTG 600 

|.:CGGGCGCCG GAGNGCGTGC CCGGCGGCGG GGCGAACGCC CAAAAGCCGG GGGTTGCCAC 660 

CGCCCCCGCC GGACGGACGG GTC:G(:GGGGA Ti-CCCCGGTT GGCGCCC;GrG CGGCGGGCAT 1?0 

tggtggtg!::t gaag.:ggtta (;:^'.]CC(.;(]'n'r ci](':;c(yi-i(:r GcrGGTGGGG gcnT':;ggcgg vro 

i:C'.c:ri:cGrr :;ttg "-.^gta-^ a :^ :GA(":v"GGr rG':r:'G(;(";G':G GTTGc:cc;G(7i ttcxcggcat 8^0 

TG=:"Lu< ,i:[/rr sggc;. ;( v.tt-'^ (:"xk:gc;t7g:' :'G:'' -G:'CAr(^ c;cv:Gr;NTTGG (VC;ccggggc^ 900 
:ggcgc;cg(:;g '::r;c -'^ 
:2) : r;i--oi<MA'rTC;:j i'^'^R :tKO i :.) UV':!': 

;i) :;K0UENG1: Ct!ARACTE:';l CTIGS : 

(A) Lr;iJGT[i : 1H72 t;.ase pa.: r - - 

[B) TVf^E: nuclojr- acid 
[(') ;";Tt<A.Mi;K[)K?:GS : sinqjt- 
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GACATTGAAC CGCTTGATCC AC/fTCCATCf ' :'(;C'AATCCAr^ CCi^GOT; ^ATT CGGGCGGGCC 7;?0 

CGTCGTCAAC GGCGTAGGAG AGGTGGTGGC; TATGAACACG GCGGt^GTGGfJ ATTVACTTCCA 7H0 

GGTGTGGCAG GGTGGGCAGG GATTCGCCAT TCCGATCGGG CAGGCGATGG GGATCGCGGG 84 0 

CCAAATCCGA TCGGGTGGGG GGTCACCCAG CGTTCATATC GGGCCTACCG GCTTCCTCGG 90 0 

C'lTGGGTGTT GTCGAGAACA ACGGCAJVCGG rX]CP,CGAG7C CAACCrG'Tr-G TCC;GAAGCGC 9G0 

tc::ggcggca agtgtc^(;GCA t.;7ccaC'':gg ■:x^aggtgatg A:.:ci:;ci:;GT^:'G acgggggtcg ]o?/^ 

[;atgaa;:tgc; cggaccx.:;gga tggcgc;ai:gg i";GTTAAGGGG gatgatcggg (;tc;aggtga7' iobo 

GT .■CJGTGAA'"^ TG^r'AAACCA A :nT '^r;^: ; <^A':''"-^ '^■■TAC'A C^TGAArGTGA ATTGGCGG/^ i]40 

G:^^AGGGCG(-i GG::rGAT?TG T :^GGGGATAG GAGC^^( ;GG(;G GGGGGGAA'^T GGATTGGGGG 1200 

ga;^:g^:;tgat t:;:gggggc;a Gi::GGi x.:gagi rG:;GT::7CG:G Gr:;cGGGTG.;; c:A7T:;TG::;A;i i.Gjo 

c;':A.^\rGAAGG AGGCAGAAGA GAC^GGTTGAG ■'"'ArG'GTGG'GG TC^'^Ai^GCG'AG ?TA::GTGGAA : 370 

GG::G:7r-:;TGG TGGAGGAIGG GGA':GC:AA7 GAGTTCGGGA GCGCCGC:'.]C GGr'^ZGGGCG .1380 

GA7GGGAGG7 GGTTT.V\:;GA GGGGC'V GYG 7AGGAGG7G''^ TGGTG'":G .777 G77V' T'G: .:iA',- J 4 0 

GGGAGGGGGG AC:GGTTri7GK GGATGTi'^rriT 7(7ArTGATi:7; ATGC7':':^Tl7-^A G'rAGGTG'7AG 1^)00 
TGGGTTGGGA 'TGGAGTGGAG GTGT7G^:GG7 7GTTGGTAG7 AG77AGC77'M- -G(77-^AGGGG 

GCH'TAGGAGA TTCCXXiAGTG GTA7AAG'7:'G 7TG!:7 GX^AAG ■I7:G77A( 7777 ( :7A( 7,AT'7'r(7 ] b;7) 
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(•?tcg::cgaa acctgatgcc (-aggaagagc; gtgttgcggt gaggc^gagg gggtgc.:(;ac:g b{j 

GGGGGJTCGT GGCGGAGA'l'G AGGCAGTGGG TTGATGCGAG AAAAGGGTTG AGGAGGGTGG 120 

ACGTAGCGGT CCGAACAACC GGGAAAGTCG ACAGGTTGCT GGGTATTACC AGTGCCGATG 180 

TCGACGTCCG GGCCMTCCG CTCGCGGCAA AGGGCGTATG CACCTACAAC GACGACCAGG 240 

GJ'GTC':CG'i'T TCGGGTAGAA t;GCGAGA/iGA TGTGGGTGAA AGTGTTGC^AC: GAGTGGAGCA .I'JCI 

ATCTCGC^GTG GATTTGTGAA (:TC;TGAAGT': CAGC^GGTGCT GGATGCTG^^G GGTGG :53TGA 3bO 

:'GGag-:tgct gtggggtgtc: A(;(^Ay\r:cTrf- aaggggaagg taccgaag'I'g ataga.:ggaa ^.^c) 

T7T G \G J AG C"iy\7iATC^7iGG Gv.)G.~i.v"t G\T . i >^ ■. . j', ^ . : '_" T' iL^'AAC^ATG '"TTGAT ^GT''t >Ul 

■:;r-;:^:Vv.AGAG 7ggaagG'':gg gggagc^gtgt ggattgc^'Ga GGACGGr^r^x^ :ag(:a-"'gtcg ^).;c 

gg-;.ja;ggag gatgga^:gtg ggatgc::^ggt ggattcaggt c:agggagtgg a/iatg ;aacg b'.c 

AA'.:GG' .TGAA G:;TGGA:TAr; GGGGAAGTTG CGTGGA'':i";GG '"T ^ ":TC:' ^AAA :GGf:C':'TGTG 6^)0 

.W'::GGTG'rGA agggca:ggg aaaagt^'^-agg cggi-gagggc atgtgaaaat tc;acgggcta 7.:o 

GAGGG' .(:;GGG TTCiGTGGTTA TTGTTGGG'l'G '-TTrvXyif 'T^ ( ITi-y-K ;Ai;G^."'G GCGGA-";GTCG n<0 

':-GGT';; ■ TTGA GGCr;GTAGC:'I CVrrccCTr'Tr. Ar;GGf 'GA^'^GA GTTGAGCATG GTGr;AGGAGG 8-10 

GGiTrC.'.ATGA 'rGGGGGGAGG AAGiJAGGTCX.; TGt^CGGGCGA AAAGGTCGGC^ * :CJ\r.C[)Crrr, 900 
AAi.iC-'.''. TTA7 I'GGAGGT'GA' ■ ■AGGAA:-:G'rG ^"'rr,::~ ]r'\ r A< 'rGCA.; AL,o A GArX :Ai ;GTGG 

.■\mv.AA' -Av ;gV :GGGGv 'G^' ; ' ; G: ^ G " AA G ' ' . -'"-GGiGTAAG ■ 7 - v ^TT '7' -G ' ' AA-'GA^ 'AG^:' VAG^ 
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(H) TrFOI.OGY: 1. noar 



(XI) sequence: ORSCRTPTION: SKO in NO: 19: 

oAATTCGGCA CGAGC^GGCQ ATAGCTTcrpf; :;gcc:gc :;gcc gaccagatg:^ ctcgagggtt go 

::gtggt::ggc; ggcag ..xiccx:; (;gcg.:agcag ;g :'gac.:g(st gaggc;gctgg aagacgccga 120 

c:)G7cr\Z7CG t?:^gt>:g:^g a'jgg::a^::c/Vi gx::ggC';';gtg g7tgcctac(j a.cgcggcctt ]^jo 

;::x:g7a:gaa at^gc/^taga TCGrr;GAAAc; :::G':;agtgggg aggatgtg:g (;gcagaagcc ?.''a)' 

G..Ai.AAjA'rG TT.:i"i' .:i'AGA T JAG .:gT'.:ta gaaggaijgcc; j'acgtgga:;g gggggc^aggc 3;o 

G':;AGAAGTTG GATGCGGAGG GGGT^.^CIGGG GGGTATCTAG GX^NTAGGAGG GGGGCAGCGA 3f;i) 

GGAAGC^GAGG AAGAAGGr]-:;G AGATGCTGGC: GTCGGCiC^Gl'A t:;GGATGGGGG "G :;GG ::TGGG 4;'!) 

gggaggagag ATGGri]G,j:(] Gv:gagtgc;ga g.^ :'ggg(g.;g(^ GAGGiGiGni' gg^ti:;aggag ^.ho 

TTCiC^GGGGAG CTAAACGGGG A'":GCG ^GTCGH' ■^Yi7'GGA(^ACG GPiGAI\GC7CC GGChCCCCGh 

tggggg(;ggg cG:GGTGc:Gr^T Av:gtgag(";ag aggggt(;ga(; aa^'GGTGGGc; ggi:gc;i;;tgat coo 

GGGGGTC^TGC; GAGTGGAT(-;G GGGvG;(;Tr: ^ '^'^/vCGAGATG r:GAGGGTGG("; TGGt'-GCiGGAG TOO 

A'"A' 'G7TA' 'rTGi^G' 'A: "( "<"■; AGG(^GTTG:G^ GTGTTGGGA*'" AGTG;:.;GGG'^G ( G"^GGGG' jT'"'G V/O 

':":a':tt':aa'' rr-^G/^^'G:^' aaG'^ggag g g ;GTGG::';:':r ggtgggag-'^; (g:g ';GrG:^GGG vr'' 

[iG^Gr "w/.^G'^G /GG /\ .\G ;A: GAG'GGC^G TG ' ' ' ' .";GG'""G'; ;i ; :rr(:<:('.' ■ A G"I AG"^^' H ( ^ 



- V'l r:G :i i-G : ■. ;i 
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CAGATTCAi'A ACGAATI'CAC: Ac;CC;G(;AL'AJ\ C/V^TATUTLX; CGhTCG-ZG'S? TTATTi'CGAC: 1,^0 

AGCGAAGACC TGGCGCAGTT GGCGAAGCAT TTTTACAGCC AAGCGGTGGA GGAACGAAAC 1 HO 

CATGCAATGA TGCTCGTGCA ACACCTGCTC GACCGGGACC TTCGTGTCGA AATTCCCGGC 240 

GTAGACACGG 'I'GCGAAACCA GTTCGACAGA CCCCGCGAGG CACTGGCGCl' GGCGGI'CGAT JOO 

CAGGAACGCA CAGTCACCGA CCAGGTCGGT GGGGT3ACA(; CGGTGGCGCG GC;AGGAGG3C 3f:0 

gatt?c:tgg gcc;agcagt7 CAT-:"x:AGr(;G ttc?t:;::ac(} aacagai'CGa agagctggl.:g -120 

ttga7g:;G/W^ GCc:TG::;rciGG ^GrrGCGGAT ( :ggc;c:cggg;.; ggaac:gtgtt CGAGcrA':;AG 4 80 

aa:tt"gt-':g ::acgt3aagt ■:,< ,Ari, rGG:'G cx:gggC':x:at gagggggggc g(:ac:g:t<v::c b-]n 

gc;ggg::cg:c t::tagat3C': rc;n.-;GGGGAT gagggagtg:; ggggmtgggg ccGcc::c'?::'f goo 

tg:ag::ja.:;g jgttggt^g:; ■;;i:G':;3ggtG':; tgagtaggaa tgcaggggag 'a.'cgacctgc goo 

GGGNA.AAAGI ::gatgtG'::tg GTA':;rcATGv7 ag:^ttggaG''^ agtagaggg'O gggc;g:ctga i:]0 

GCTGGGGAGC GGTGA^AG^CJAG TTGr:GGATAT TGGTTTAAG':^ GAGG^TAGTGA GGr.TG:GAGG 7 HO 

GGGGTT-GGGG G' ;aggg :ggt :;(;gc:G':^agti:j gtgg':gaggt atggggg:^;:;? r7Tc;':GGAG^ &^,n 

AAGAAGGTGG G;:Ai';o;AGGGC^ T(;(;;/^GG0GGG GGGATGGGGA GAGCC^GG ^i^C; Or^^AAAAGGA 900 

GATGAAGAGG GCr\GGGGrJ:r. GATC'^G'llGGn GGGGGGTGG(] GGAATAG'^c;/, AGi,;c;gTGT AG 9 00 

GAGGG(;l*AGC: A'yY'VGTT'VT': ' "GAGrAGOGA A? iGG TG r':'' ; ' :GG^Y/] ^'0";g:: G(v'-Gr:T':AAG ](G"^ 
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'vccGCCcc^G ANG'rcx.:."A;Ac: cGca^rcc/\(\ -:ggacci\(]k:: tg:"'tcaac'Al; nctogncgat 

CCCAACGTGT CGTTTGNGAA C"AAGC;GNAGT r7GGTGfnAGr; (;\^ ;GNATr.\0(; NvGGNANiTGAC; 300 
GGNGNGNATG GNCGANCACA A 3 21 

[2] INFORMATION FOR SEQ ID NO: 22: 

( i) SEQUFNGF. GHAP AGTRFM STT CS : 

(A) LFNGTfi; 3/3 La.^e pairs 

(B) TYPE.: nucleic acid 
(C ) STRAN1)KCNES5; : s ir:q.] c 
(0) TOrC'LOGY: 1 : near 



[XI] :a-:,)0'r.:tJCF : 'f.G j;< i FT I ON : S : i) N'J: 2 2 : 

TGTTA7GGGT TGGGGTT0C;G A ::GGGT'r I T GC;GNGGGGGT C;GTT7VAGGG(; GTGOGCCAGG 60 

GGATGGAGGG GGGGGGAGAG '^TGGAGTG::; AGAG'l'GGC-CG GGG:;GTi3GAG G2G::AGG2GG ]2(J 

ci:tgggtggt gnagggg::aa ggg(;tgaa:^g aggggttgna {,ag^gggatg aaG'^co^attg iho 

AGGGGATGAG >"r:GGA7C0^GG GGGGGG'-^AOG GGGA.iGTGAT rA7":GGGGAG GGGAAGAGGG 240 

GGAA/GiA-0(X; ■ O^GTGTGTCGI' C^GGAGAGGAT ::^(^':'GAAAGGA GGCOIGAAGAA GTGGGAGTCG 3 00 

GG'I'G.GATOG:' AAGAAGGA :G "'GGGGITG': ;, TATA -GTTGG ■'■In I r (:<■:(;■- AGAAGGGGAA 3(u) 
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G'XVVGGAGGT TTTOc ;tca(^(.^ g'x:a(]GC(:tg (;naac;tggc;a cac^:;:tgca;:: gccatngngt soo 

TTGACGAAGA NCCATATCGG Ni.iATTiXGNC AGATNGGAAG TTGC(nAMGGA GA 3;)2 
(2) INFORMATION FOR SEQ ID NO: 24: 

(I) SEQUENCE CHARAt^TERISTICS: 

(A) LENGTH: 7:6 bc^so pairs 

(B) TYPP,: nucloic acid 

(C) 5-TR/iNL)KDNF.fAS : sj ng.le 

(D) ^'OPOLOGY: i incai 



(>:;) oKQUPlNGK Drir.GR H •? ] G N : :GGj H.) NG : : 

gaaatj;;ggg 'rTGAT'rc:G(;T 'rc.v;AG7AGGG ggtgggc;ata atggaggaai.^ tgatg/iAGcg bo 

GCr;G7TC'V:.r G^GGTGATGG GTGAr:AGCX^A r;:AA';GYG:GA AGTTGTGTGi; TATATGGCAC 12J 

GTAGG:^T:CA GTTG^TTGG:; AGATGGGTTG GGTAGGGTCA ^rCGCATGTAG G:7GTTCGCGT 18] 

gg*.:g;ja:.:';'"T gaiggg'GGG ; GvGG>.;gatgg g 'GggaG'..;gg 'iggI'SGGgggi" gtgggggtcg 2^(} 

GGGGGGAivr^^ CGG:A(r'GGAA AG :;rG;GGGG TGGGGGAGTA GTAGTGGTGC'^ GGGGGGGAGG 300 

^-TGTCG^A^'f^C^ ^GP^ATC;r;G:7:; rG\--AA'"Tr;r::;; AT'-G^' ^TAGAG ^^TG^G'ATGA^- r:AGTTGCAGG 3 6n 

' G"G -A'^'"' ^ - g'*g';'VvT';ag ^;G^A':'A■■^■" ; ta'"Aaagg:'G g- iGGi'c;'.;; ■ "y..-..;-' 'AAGgta '>-u; 
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(XI) SEQUKNTP: DKSr KIPTJON: 5n-:C) in NO:2'j: 

CGCGACGACG ACGAACGTCG GGCCCACCAC CGCCTATGCG TTGATGCAGG CGACCGGGAT 60 

GGTCGCCGAC CATATCCAAG CATGCTGGGT GCCCACTGAG GGAGCTTTTG ACCAGCCGGG 120 

CTGi:CCGATG GCGGCCCGGT GAAGTCATTG C:GrCGGGGCT TGTGCACGTG ATGAACCCGA 18 0 

ATA'.rGGAAGA ATAGGGGi^GT GATTTGGGAG TTCAATGTCG "^TATGGCTC; GAAATGCAAT 2A0 

GGGGGGGGA7 GCTLT:;GC:jt: .;A::(;A(;G::rr (;::ot.:AGGc:Gc; c; j::ac;ccx:ga A':G'rGi,;AGGG :K'0 

AGCAC7TAAT GGC(^c;GGAT AAGCCCGG-'^A CCGGCGACGG T^CTTTGGAA (^CAACTAAGG 360 

agG(;:;c;G':(^g '^attctga'!':; ggagtacgac ttgag:;gtgg gggtc^gcctc^ gtc:gtcgagc: a2Q 

■rGACACC'A-^A GGAAGGCt^Gi- G'.'AC'i'GGG'. (■; AGGAAG'IXWA AGGGfJT jVU "i' AG(;t;vAGACC: 4 80 

agg(:{:;aaggg ggaatggtgi; gcgttagg;.:(^ cacaggttgc ggtagatgtc gagtgtctgc b^iC, 

TCGGGGATGT ATG(^GGA(;GA i.^AACrCTTG-"^ ATACAGGGC: b-8 0 

(2) INFORMATTON r\'R SK(.) ID NO: 2b; 

SK(.)Uf:NCK CHAIlACTERirn'IGS : 
(A) LENGTFi. 1 GO ba.so r:ai:.-s 
(li) TYPF : riucJ eic ac: (.1 

(G) rTRAr;r)::DNE5^5-; : -iruro 

; n) 7()PGL,();^y : l i near 
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(ki) SEQUEliCK DESCniP'TlOK: r>KQ ID NO: 27: 

GACACCGATA CGATGGTGAT G'l'ACGCCAAC GTTGTCGACA CGCTCGAGGC GTTCACGATC 60 

CAGCGCACAC CCGACGGCGT GACCATCGGC GATGCGGCCC CGTTCGCGGA GGCGGCTGCC 120 

MGGCGATGG GA/iTCGACAA GCTGCGGGTA ATTCATACCG GAATGGACCC CGTCGTCGCT i80 

GAACGCGAAG AGTGGGAGGA CGGCAAGA/iC ACGTTGGCC^T TGGGGCCCGG TGTCGTTGTC 240 

GCCTACGAGG c;vGAA';:gtaca GA(:c7iA-:x.;cc: gc; 272 

(2) INFORMATION FOR .SKQ 1 [) m:2b: 

{ I ) 3h]QuFA^ ".K GtiA^/U : 1 h,!-: 1 .•;T] GG : 

(A) L[::NGTH: 317 bd.::e pai i ^, 
GO T'lPK: nacle.ic acici 
( G } 5; : RANDE DNEGG : : > i n q 1 e 
(2) T-GPOLOGY: linear 



(k\) GEC'UENGF DEGCRI ["'TIGN: SKO I [) NO:28: 
G;'"'AG<-^'-GGTG GT7GTGGGAG TATGTGGC^GA G-GTGAGGGA GGGGGAGGTC; C^^^GC^AGGTGA 
AGGGGATGGA r.GAGAGGGAT CC;CrTGr::G': GGTTGATGGG TTAGGTCGGV' G'^'TATGAGCG 



;':ga'2g 



GAGG'. ■ 



GG'^AGGAGGT GAAGCITGGGC: GAAG(^:;GGG.: ' 
GTGGG:^ATGT .■GG^■GIG;G""^f" GAGA 'G::;T^-;' ATGGGGGAGA 7 '/G GG'[ G" ' G/GT^ 
vG.;A7\T':"LGAG (.G.;Ci7AAGA2G AAC;AAi:iG( .GG r AAA^ ^A'l'f "GA ' •GT^:GTGGA( ' AGVGC 



7ATG(^ 



1 ;U 



■ . i r r . : r, ; G t. 
liTivGGG -GGGG:': 
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ocAGCGCcx:;:; Ac^GAC(^TGc;r cgCiTGGg;:a.; (.^atggtga'I'g a;\:agg':'Gl;^; G.-rGi.-;-; .:cag 120 

C:GCTTCGGGC' GC^'^CTArl^AA AC:AC:G{;( '(^AC AGCGTGG'GCG GGGGCGCCGG AG(;GG(jCGG7' ]H0 

GG 1 2 

(2) INFORMATION t'OK. SE:Q IU NO: 30: 

(i) .SFQIJENCK CHAKjACTKRT^^TICS: 

(A) LENGTH; 308 ba.-e fjair;; 

TYPE: nuclej.c cicid 
(C) GTRANDEf'NFGS : rina U' 
ICH'OLUGV: linear 



'^^TrGX:G^^^'] TTr(^GTGA''7C A3:7rGGT:GA GGCGAAAGTC TG:^GCGCGTG GGAA';^G:^GGT o.) 

Gi^;CGTTCAG GAGGGGAAGA »A^: IXGTGTC : v^GAGGTGGTC; CC^GCTCGIGT AGGGr:GG;^. i:^i) 

GAGGTTGAGA TTGGGG;;GGG GG'SGCiAG aX, GTAC;CAAAGG TTGTGCGGi^T GGATGGT::aT IBO 

GAGACTGGG(^ C"";GT TA."-;iG(\^T 'r(:;AGGATv^GG iV?G'lACC'^CG TGGCGGAGGA TTTG("^AGGGT 2^10 

ggc.;t7G7gac; agcaggtgct GGAAGG::rTT cAGG' •G'r':v\A GGGGTAGGTG a?G(:;agac:cg 
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(I) SKQ[lt:NCl': C}IA[V\CTI::RISTIC:: : 

(A) LF-NGTH: banc priir.-. 

(B) TYPE: nucleic acid 

(C) STFANUh^DNESS : single 

(D) rOPO:.OGY: linear 



CT:^G7v.;:rA:;A aagaatgtga ggc;gagacga tgagcaagga cacctaggga gt:;atcgaca (jO 

tggtcg^^ga;: (^■]''"g;'G^gc ac^ :";g;:g t::gacg ggggaatgga :,i;ggc ;gtct: ; g::'cgagctg ig'o 

G;;GAc;A:GAr GCG.:^':;G :;'..tg ^AGTr;.";TTGG aaggagagtc^ aatg^gagg-; ga ;x:tggtcC' iho 

ag:;gag:ggt ggggca::ttg caggt^'^a^ta g^^aaagtcgg GTTCGGCT:iG ac^^attgctc; 2^10 

AAGCTTGAAG CGOGG^'^GAT AAGTGAGGTG GATC:A'I"rAAG CGAGTT7TGG AG.AAGATGG1' 30 C 

GAGGCGCTGG AfiArcrr.c^T CAGGGGAGC^G TGGC'ICCGCC GAGGCGGTlK: GTGGAAAATG 360 

^:^^^':gggagaa tgcgt<:'.^ggg gg-iGGtagaa gg;-;iGGgggg gg'g.:^j\tt(\'; T'rGG(;TATCT 4 go 

GGGGGAGG7G rGT(nGG<3TGr^ AGGCG >( ;AGG/v AGCr if ,'■'( K ;'['' ■ GAGCTr'GGGT ( ""GCi'/TGGGC-; A>yi} 

Gr':;GATGG(G; GGGGrC'-'TGA (^GGGGTATGG C;AA.-^AG(G;A(^ 0(:A' GG^TA'/G GGC;(;GTTG(7A '''-V' 

■ ^' ! AGGG'^GA'.i GTGG "GGA^V-. T:''GA- 'AA"'i ■ " - 'T G/^'Gw-. ' ■ ' G^ ^/\.-.AA' : /.G ! A' 'Al ■ i'l' ot.c 

A^'GAGA^'TTG ;'~r;r;'r'rr- 'f'AT A':^- G;G/\T( ;< " ■ G < :'Gr:":;/G GG*. iGGGVr G- ' '1 G A'G 'GGGGG 7 >' n 



wo 98/16645 



S3 



PCT/US97/18214 



Ac;(:c7c,(:f;;'i'G ATCGf;Tc;ATC act:aa^g:";tg acagcagcgg G7'TGTGGA':-r agggcgaagc; 1320 
cca.:ccgi;;gt ctC':gggtct gtccagcgga tggagggggg c;aaggggaga tgacgaaacc 138 0 

CCGGCATCAC GTTGCCGATC GGCATACCGT GATAGCCA/^G ATGAAAATTT AAGGGCACCA 14 4 0 

ATA<.\ATTTCG ATCCGGCAGA AGTTGCGc^rC GGTTGCGGGT CAGGCCCGTG ACCAGCTCCC 1500 

GCGACAAGAA GCGVATG^GG TGGATCTf a;*:: GTCGTGGCG 153 9 

{? : rNFORMATiO!J F'.ni GKU IK ^JG:33: 

[1] ^^KO'JUICE GtiAKAGTKKTSTl CS : 
(A) LKKGTil : ha^;^^ pair 

(C) f/l'RANOKDtJKS^: : :;jriolo 

(D) 7G[0[/")GY . 1 1 ruMi 



(XI ) S'KCUJF.NCK [Jj'SGKI PTION^ : GKQ 10 rJO:"V-i: 

GTGGAG'^GTC; r;rT;'I GGATGA GCAVrCAGC G--; GGi ;G:'AGGr '( ' GAGGTGAGCG GCGCGCAGGT 

GGGGC;TTGCT GCGC-CGGCCT A(X^AC;AGGGr cn'ATGGCGTG AGC ;( ri'GGf CGCGG GAT 

G-.^GGc;A.;AAG G(-;'iC Ci'C.AAG 'l'GArc;A'rT' "F GATA'^GGA^'^' AAG^"'T-'T7r/'; r;G(:AAAAGAG 

f'';'GGr;G "^ATG G^ r^'AA' ; AfiG^TGAATA GGGr>:.AGAT^; TG' G :rx-^:^ AAG AGGGGGGGGG 



GO 
1 2 (J 
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( i ) s E QU E NC r I { A hj\c;t k f ^ ] s T ] c; S ; 

(A) LENGTH: 25^ tjase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



r;ATCc;ATc:G:; (^irxv" ";aaattt (^c;accagatt ccxxn'CCo'n^ c;ataac(:caa Tt^AATrGAAC GO 

CTACiATTTAT tccg'tcca:?.] :;c\ccr.Gh^.]7i\ AT(;r;cTc:/:A : :^;AC'AC^■;AA" cttactgctg 120 

c:GGGrACC'!'c^ rc(;i'Ac;Gi'c;r :r:;ATAc'' ;g'' ggaagg.:'J':c .^acatt'i tg'" aggga^'^aggg \so 

r:rATGGAAAr; gtt'ggagg(;g gac7'ggagg': tc^tgal ;G''^ag :g:gagggagt ggc:-aggctgg ?ao 

GGTTGr;TG.AA GATG 2b'] 

(2) INGORMATIGrJ hGG< SiG^ IG [G ) : 3 S : 

(I ) Gr:GwF:NGr: G;iAKA-^Tr::-;i:.;'riGG : 

( a; GENGTfl ; ] 22 V ba::^.■ : s 

(H) 'I'YI"']-G nucJo:^: cjcki 

; iG '['(G'f ):/)GV : : i ri'^a ; 
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CC.CTGCTCAG C;TT;:nCrAA:^ GC::TGATC. AGCGGTTCr;'' : CrGCArXGCG TGGTGGATAC 6 DO 

GGCACAGCGC ATTGCGMCC; ATGCTGTGGA CATGGGGGTT G'l GCAGCGCG TTGAGGTATG tni) 

CCTGAATCGC GGTTTTGGCC GGTCCCTCCG AGAATGTGGG TGCCGTGTTG GCTCCGTTGG 7 20 

TGCGGACCCC GTATATGATC GGCGCCGTCA TAGCCGAGAC CAGGGCGAGG GCTACCACAA 7 80 

TGGGGATCAG GACiGC :^:TTG 7GGGGTCGCT TCGGGTAGGA GAGGTGGGGG GGGAGGGCGG 8-^0 

GATAI'GGGG'-' JGGC : :'AGG -CGG^X;? :^GT CT'G'V-fV^TG^: rc^:;::;c;CGAA(T GGCGGTT::GG 9-"'0 

Gf-GGGGGGAG GGGG7:^GGGC- I'AGG GGA SCiC; ^'GTGG'JGGTG 'G'i' ^iAGGAG GGGTGGCj'jGT 9^0 

r,('(^::rc:rrrr: t ^^GCGJ'T '-GGG GGG\GAG"'GG GGTTGGGGGA G'l' .iG ..GA'""'"G i G^^TATTGrGG ii.J.-:u 

TTGTGGTAGt; GTGCri'i.G,A':;G ";G AfGGAG'^TG GTAr;GGGGAf: AA :^:':-;CX:G' .;T GGGGT'"^AGCG lOHO 

G(;GA(;GATC:G GC^ATiy-y^GT GAlXM^GGGTA gg(^ac;ggtag CGGA.'\GAGGT GGGGTGAl^GT 1 1 K) 

G'rGA/vGGGc;A ci':ggc:c-'-g<:;c i':gcggcgccg ataatg-i'tga aac^agtaggg aaccttagga i ^do 

ACGAAGGAGG GAG ATT'I T'^^T CIAGGAT':^ 12. '7 

(/; INFOKMATION yO]{ GKG ID GL'::H.: 

; ; ; Gl-:GGF:NGK GrlAIi;\GTr:Rl GG^GG : 

[A) GKNl-'I'H : 1 e 1 bci pa i >.-^ 
( V I 'i^YPr : nu< 'Icm r ,if^i ri 
('^' G'rRAN'VaVJRG:' : .;i:vj.:^' 
( ^ TGi^( G.OGV : ; ri'.-a i 



G . 
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{XI ) sequence: DESCRIPTI'JN: 3EQ ID NO : 37 : 

GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGTGTCGGC GGCCGGGGCG 60 

GCGACGGCGT CTTTGCCGGT GCCGGCGGCC AGGGCGGCCT CGGT3GGCAG GGCGGCAATG 120 

GCGGCGGCTC GACCGGCGi';C AACGGCC;GTC TTCGCCi'AlGC GGGCGGTGGG c;GA(:^GCAACG 18 0 

g(x:gc;gal:gg ^ggct':':^^^!' gc;gaacg(;cg gtaagggi'Gc; ccagggc:gc;n attggcggcc; 2A0 

GCArrOAGAG ■XCGAG'A A^G A TCGC;tA:;G7G ACCA;AGGAGA CGGGGC/IGAG 20f; 

(2) infgrmation y<rr< skq in n'AtJH: 
(J) lUvjUENC!.: (AiAkactkh: st:cs : 

(A) LKUGTH: ^-l bri:.e pans 

TYVL. riucj.cic licid 
(A) GTPAi;[)EL)NE5AS : .>inq] c 
(A) AO!A)I,C>GY: linear 



1X1 ) seguencf: Ai'.sAi^ ; i^r Kjr. : sko : : 
GATAAACArGA GATGG^;(■ A^r A^ ataagt(;gaa AAA': 
;;a J uy: m^AA-y^ou AfA- aev "i e ea; :.a.) : 

AM TYfA- : A •: i ■ ■ ^i. : i (■ 



[A.): Ui : 
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(A) ]-KN[.;TH : ■) < bds*^ p.i : 

{'r.) STF<ANf)KDNi-::;S ; :iinqi*^ 
C)) T:)rOLOGY: linear 



AT(;(;(:c;Trc:A r{]GG(;cGCC(^ Gr;(;AC(x;r;C'" Ac;cT^^r'r:N(;G c;c;rc(';0GC(-n tc^g 

C^) INFO[<MA^MON P'OH I^IvO ID TJO : ^ 1 : 

(i) 5:ECUP:NGK CflAKACTKRTS'r^TG^ • 

(A) Ii:^G';i: : : 3.' pa i rr. 

(B) TYVK\ nucleic: acid 

{ [') TOiOLOGY: ] i riea r 



iCAi) SRODKNCK [}[• ;;C1<: FTION : SX'; 

(;A'r(.:GA(:c(;G cccrrGCAC^AG ggi'^ggg; ;''g ■ag'G-:-; ga* '^^aG' 'G' Gr;rAAGGGCG 60 

Gr^Ari'TM^cru; c AAr'r,c;G>:rY: AAGGG:^:Ar'Gn ■"r(]': : ■ 'W-cc(]C]c::-;:c\ G'^tggc^ggga : ;g: 

AGr;f,;-(;(/-AA re, i ^P 
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I A) I,F:M*.;':'}J : 70? i^aso i>H ] I 5 

(H) TYPl-;: imcJt,^ic- acid 

(C) STFANDLLiNEl-^S ; .single 

(D) TOrOLC)GY: linear 



C.AA;CAtx;/UTC-; MCGLVlhrcr r.C(X]-~](:A'TC(] "A'ACC/TilCCG I\T'YCG::CGGG T'TTCCGCACG (>() 

r;.;AGGAAAGC CGC'!7^C:;AGA TGGGGCTGGG. GAAG'Yl\GG(\G GATGCGTTCG CGATG .:CGGi: ] 20 

A"':;AAGGGGC' GCiCATCAAAI -T^AGTi^GAGGA A''X'GTrGAGr l"rAG'":GAGGA GAATGGGTA]' 180 

AGv'A'.Gl'AAGG AGGA'l'GATG GA'rAGC^A^ 'G' ' AG'rCG.G:AG;r"" CGGG;AG(;C-;VG C-'A 'i'CA! tCAA ' j ^40 

ac^attig^jAA gaggck^caag iGAc;(r;GGAG(' cr'GCGA7C;r";c g:;a(x:caccc_; actga':'gtC': 'M)o 

GGIVYGAGACC [VVGGGAPiCTG ACCG;nG(;N77V A/IAAGGGG! G;:;AAC:AGNrr; G7'NTTGTCCG 3bC 

GCGA(:^AA;^A':' iG^"GGr.AA':Ai;- C:TGGCX;r;GGr: GT;.G:rA;i/G;A r;(:GGGAGGGT GTG(^Gr;ACCT 4 GO 

: 'g:-I(g:ggaa 'Gggl;c-cg::aag GrhGTATGG^^; A-.i.":":TGA?GA gc:aggct(.;gg AG^GCGGTGG 'UiO 

A'^'AAG^;ACG^G ' ^GAA'' .GAAGT GTGG/GiGGAG /w'i A. GiG', :(.';■. ; ( ;GG'. ' G'i'GGG A GG(G(.7Af 'AG'IG' '^^A) 

a;G;GG;-;aA: AA^ A'i'.AGACi; ( 'GGAGC;i ;'ri :f ; ^GGA' :r;: 'GGG '!"GAA( :7GAA^ " GG' 7G'C-,r^ATf: hfjO 

GGAAAGAA: ^''G^ ^ AAGGAAG AGG 7G\Af;^^' " ^- G.:"A' ^^"AA A : ^:G"AG'AG G ■ ' ^:";g "'A'""GG:G-: G(-,G 
A^:"^AGA'::'G : GAAGA 'G'lG^^ A' G ' ' ] GA' " : ' GGG\A^'-;'G; 
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CTGGCCCC.cc i^r':;(jc7^'i\,:H.; aa7x;cv:oATC^ r,rr(]:r(;(' ATf'A(;:;GAv:A ac^:^(.;ggci.;cc: 2']0 

AAGTCCAAGG GTYCTCAGCA GGAAGACC^AG GGGCTGTACA CGl^AGGATCC TCGTGCCG 29S 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUKNCF CHARACTERISTICS : 

(A) LENGTH: 1 OSB base paxrs 

(B) ryPF: uucieic acid 

(C) GTf^NDKDNK.^S : si nqle 
([)) TOPOLOGY: Linear 



:xii :.;[::QGKnCK GrG;CKi pt:on : ppp ':p Mj:^::,: 

g:;ggagc;agg AP(":c;AA'rGG: ' c;tgg:'ggg;";a gga:7\gi;gtg ggagtgca;'^' A';tggagga(^ 

:'CATGAc:ta ctgc^cggggt AA(x:GCGGAr agggggaagc gcaggcggga gc;ctcctagg i;^o 

-.;agg'.:gt JAG ac:ggtcgttg G(.:gga(":gg< •■. ; atga(;ggtgg gagcaagcta cggatgtagc iho 

VGhhrr^'rr^f^ GCrr^GGGAG'G; CTCGGT^^'GC; GTOGGTACT'I' cggcagcttp GGGGCAATGG '/■]() 

rcACCGTL-Ac; TA(':(:GAA(::Tr; OGGGGCGGPrG atgc;g(;cagt gti:ggc;tgag AcrGGr;c:TGG 3i)(; 

/gc;gggc;ggt gggtggggtg G^ t^-gggggg gtggc{;Gg:g' GrprTcroG'^G ^T^TAAriGcrA :gp) 



■'■■PA'VA 



wo 98/16645 PCT/US97/18214 

(2) : N'tXlKMATI ON FOR SKO J 1) NO : ^ : 

(1) SEQUKNCK ClihliI^CTh:RlSTlCS: 

(A) LENGTH: 327 ba.sp pans 
(D) TYPE: nucleic acid 

(C) STRANDEDNESG: single 

(D) TOrOLOGY: linear 



(XI) :-;i-:0[Jr:NCK n}-:SOKIPTiON : SF:o ll^ rjf..):4b: 

:'c;o'jAcoAc;/i C:;Ac:rGATG^'o r/nViC'^oToo Gr;r:A(;c;Aooo agotaatt^'^' gagcggat't 6 0 

:cggc:gaggt gaaaagcgac^ atc(;a:gal;g ^iGGAcrroGAo Gc^GAGCTTor; 'j'tggagc:;ggg 120 

agtgc;g(";:g:;g ggggg ::c;(;gg agg(";g:"gg. /.::g''gggg:-g (■'..;':G■;^;^;'^■r^^■ gaaga^^ggag : ^^c; 

';C7VATAAC;CA GAAGGAGGAA :""rG(;Ai.:i';Ai-iA TGT'G^JAGGAA TATTGC^I'CAG gggggcgtgc 2-^0 

aatagtcgac; ggg^^gaggai- c;agc:aggagg aggg;;gtg':g gtgggaaatc: v;ggotgtgag 300 

iX'GGTAATAC: GAAAAGAAAC^ GC;AC;c:AA 3;' 7 



0' ! :^:F0F^^LATI("U'J V')R SFO :G N0:47: 

(I. Gi-;oGh:r^GE ghafagtkr: ggigg : 

(A) GENGTii : 1 r j.i i- 

(;"0 GTHANGGnGiGG.; : :;i[i:0'^ 
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(xi) 3E:JUI::n:F. description; SEO id NO: 48: 

GATCCGGCGG CACGGGGGGT GCCGGCGGCA GCACCGCTGG CGCTGGCGGC AACGGCGGGG 60 

GCGGG<_;GTGG GGGCGGAACC GGTGGGTTGC TGTTGGGCA;^ CGGCGGTGCC GGCGGGCACG 120 

GGGCCGT 12'/ 

(?) : ::i-oKr:A'riON !0'>r si'C^ ir- 

; 1 ; ijVAjVKUrK r[lAKAc:TKRr:;T]GS: 
(A! J.KNGTfi; Rl i^a;^c pairs 

GTKANOKDNKSS : r^malo 

':'or.:n,OGY: : i noai 



;>;]; :■ t\)Jr:Nf.'b. :)r::;cK 1 PT I Or: : :>h:v in no:4\^: 

(P) I rJi''(.)KKA'rU;N i-'c/K PK'.- "IP KO:P(;; 



;g(^-. ag(;gcggcaa 



HI 
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(xi) SEQUENCE DESCRIPTION: SKQ If) NO: Si: 
CGGCACGAGA TCACACCTAC CGAGT(^,ATCG AGArCGTC(;G GAGCTC(XTr GACX^GTGTCG GO 

AGGCX^GNAAi^ rcA-::;cA^c:Gc;T ■::':( ;ggc::v;ag - "i c;cscAGyu: ga'1'Gc:ccv;^\; gtc;c;actggi' 
T'CGAAGTACA c; TCAA7 ;a ";r;: -■■^aC'"7g;.; t^ga'-g^ai Grrrrc;{:c;cAG ttccaggtga ]H() 
:'TAT(;AAAGr cr;G ':ttcc;:k: ::tggaggatg ggi'ciaaggtt CAAC^cG^riGc ggataactga 24:) 

IGTGCATCAT TAAGCC;ACTT ?TG'Y.GAA' 'A 7GCT:.;AG'G(:: , CTGG/XAAGGG GGTT^.'AGCCG iOO 

\(^C\C]T('\(V:TC CwCiXiACiGGC: i :7( K'C7 .'AA AA7G(^'77fn77 A:" AA7"7"G 7 7 GGGG'^ 

[2) INFORMATION VO?. i^KO ID N0:b2: 

( 1 ) SEOGENCK GHAl<.-\CTEKlG7iCS: 

(A) LENGTH : 09^^ base pa i rs 
(}^) TYPE: nU(":lfMc: dc:icl 
;G) GTHAN[)K[)NES:^ : -ir.r::^ 
(P) Tr)r0E0GY; 1 inear" 



wo 98/16645 



9^ 



PCT/US97/18214 



c;(::'AACGGc;^; tc;tct(.-;g/i>^c; ccccrrcrsAT '?i\rc]AA(r?'^^h nrvrTrhGc;;;, 7c;c;[;a(;ta/\c; 17q 

CC";AAC:r,GCC AGATCTGGAC GGGCGTAATC GGCTGGC^CCo CGGCG/ViCGC AJCGGACGCC 780 

GGGCCCCCTC AGCGCTGGTT TGTGGTATGG CTCGGGACCG GCAAGAACCC GGTGGACAAG 84 0 

GGCGCGGCCA AGGCCCTCGC CGAATCGATG CGGCCTTTGii TCGCCCCGCC GCCGGCGCCC 900 

GGACCGGCTC CTGCAGA'XC CGCTCGGCGi ; CGGGCGGCGC; CCGGc-lGAAGT CGCyCCTAQlC 960 

CtX'^ACGACAG GC:ACArc:GCA G'';::;GAGCrTA GGGGCX'TGA ^^^9 

(J.) 1 N !■ O KM a:' 1 C ■) N I-' ) H S K O ID r J O : 'j r 

i i ) S v. 0 1 J E i^j C I. (. .. hhli / VL. "1 ■ ^ i\ .1 ; ) ■ 1 ■ 1 G o : 

(A) LENCrri!: 3^?. amino a(^:.d5 
(H) TYPl-:: amino acid 
[C] STKANUbJONKSS : nql o 
{[)) TOPOLOGY: liiK^^i- 



(>:;) GLQnKK'GK DKSGR 7 FT I or; : 500.) 10 ) : 'i : 

Mot ills iii^^ H:^^ iij;- ':[xs P^'V 0::- (i.; n Va] /u;r; To A^Wi Oou T\\r 

/'i o- 
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14b I'jO \'^'. ibo 

GJy Gill Pro Pro Pr o Val AIn Am: Asf) Tti: A: q IJe Vctl Leu GJ y Arg 
16 5 1 7 0 ] 7 5 

Leu Asp Gin Lys Leu Tyr Ala Scr Ala Glu Ala Thr Asp Ser Lys Ala 
180 18b 190 

A1..1 AJ a Arq Lon G!y Sor Asf) M(M Gly Glu P\\c Tyr Met. Pro Tyr Pro 

ZOO 20d 

Gly 7hi Aro l:o A:;n iMii G 1 ;j Tor Va ; Soi Loo Asi) Aid A:-;n . y Vai 
210 2V.> 220 

Sor Gly Sor Ala Sor Tyr Tyr G.l u Val Lyo Pho Sor A.sp Pj. o Sv,i Lys 
22'- 2^2' ZA'., 240 

Fo"' > hem Gly Gin lie i'rp '12:. Go/ V-. 1 :io GJ y Sor Pro Ala Ala Asn 
2 4'j :"a) 2G'j 

Ala Pro Asi.) A 1 ri Gly P: o Pi o Gla Aicj 'JG o Ptit: Val Val Tip Lou 1 y 

2*,;.) 2 CP 27 0 

'2\\i Ala Aso Asr: Pt'o Val A^o^^ Ly,<^ Gly A...a Al ri Lyo /\] a Lc-^u Ala 1 u 
27 ^ 2fGl 2 8') 

Po: lie A:tj Pro [.ou Val Ala Pre- Pr^a p7'- Ala Pro Ala [^ro Al.i Pi o 
2 90 20 S 300 

A. a iG a Pro A . <i Pro Ala L'r. • A. a Pr' > A G"i Giy Glo Va ; Ai ri :o o Th r 
■.G)P :n!i il', 3;0) 

Pr- Thr: Th; Pr:. T!o: 10^ G: o Ara Thj P>-a Pro Aw: 



T- oMFG 0- 

PPlG-l 



TFP :G71 
■.V 1 a 
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r NI''()RKAT:0N kc^r skl^ ii^ no:-.':: 
( 1 ) E::o [ ] N c: k c h a r A( : r f, r i s t i c : ; : 

(A) LF:NGTH: 1!) amiriO acids 

(13) TYPL: ama no acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(X)) ^JHNCK [)F:r;rRj PTU)K : SKf; T], N;0:Sb: 

A In Vd ] Gil] Gly M^,^i L.'u Aid P.-ii C; ! y Thr Pro Ala ?yo Ser 



: NFORMATK^r; rr^R spc) in ^)(>: 
\ 1 : SKQUi'Nr:]-: gharactkri ;prTG;; : 

{A) Lr:N:jTH; 1^ ar.ii:- aai;;- 

[EM TYPF: an mo ac^id 

(CG GTRAr;i)P:nNPSS : 

(P) rGP( )l,C)GY : linea!" 



GGi A^ 1 Gva I"*--.^ /;:a G G/ - G y Pi.; i;r:;j ',11; a;,i i^y.; 



; r:roprGvrp ;i: 
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I.KNGTli: 14 :iTA::.^ 
(^) TYPE: ciriiiicj dcid 

(C) STRANDEDNESS: 

(D) TOPOLCiGY: linear 



A-p I J 1 y Si I! S-T rhr Al ;i /»st, 

i NFOKMATIOrJ [■\)]< Sl-f/ I ;> h : 
( A^ AKNCTH : ; ^ ami i^;^ 

A. ^ : w'ra^I'AD'ik:^:.; : 

(i- TOFOLOAV: 1 :n-ar 



wo 98/16645 



PCT/US97/18214 



97 



([")] TOFOLOCY: linear 



[■?;'■:!: y[ 



wo 98/16645 



PCT/US97/18214 



(B) TVPK: aniiri(> acicJ 
{[)) TOPOLOGY: ijnear 



V:i i Ph< ■ sly h i Pro 



;} ■ 1 : P r 



.] .i Gin 
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Asp (.;] v.. I - r 



v.i: t'f. 

1 0 



Ar:) AAi Asp Pno Pf^u 
1 



AAi Aly A:a i^r:^ A;;f^ S-: A::p ll- P;;- P-.: .A-:. A-:- V.i: ^P:i : 

^ "-^ PC 

A:-; Ar'^i iii;; A:.. Alii A:'i ; A i ; Asn /^sr; :a u FP^^ Asr; Va] 

A' P '-''^ ! ^--y ■'■^ P''- /-.-I. Ay ''I'- T'yi : ^1 (A 'J !A:j p^t ] 

■ : 1 1 - 
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■I H:s A:-.-] Arq 



u.lri Arq Asp Aid ixni Cy.s L^-'U :ier Thr Gin lie ^ier Arq G]n Ser 

3 5 4 0 45 ^ 

Asn Leu Pro I^rc- Ala Aia (; : y ^ay ALi A Asn Ty; Aor Arq Arg Asn 
^0 S', 



Vrli Vf 



'I'M /vr q Asp l,y- A, 



1 
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T}; : A Ki A 1 < 



Ala I.ie Pro IJo Giy Gin Ala Met AJ a I]e Ala Giy Gin J Xi^ Arg Ser 
2C 25 30 

cay Cly (;]y f^-o Thr V.i.1 Hi;; I ] r;jy P: o 'I'lir /i.l.i I^hio hoy ray 



Glv V,i.i \.:] A:t- 
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Gin Sei T[u Xcia i J A: (} Xaa X.ici y Pi o Phe Asf^ Asr; Arq Gly 

S'er Glu Arg Lys 

100 

i.'i :::F0HMA'nON r'OK id :■;(): o9: 

Gi t GKQGKNGK ^HARAC'GPGT :;g 1 r\; : 

; A ; ;.r;NG lii : . m ; , i ■ : n;: 

GG Tv;^!.;: 

('"G ^'TRA!' PIG^GrGGG ;^:^:TG' 
GG GGPGrOGv. ; ; 



r-^-^^ 'I'nr /Kr.:) A- 
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(A) LENGTFl: 34'^ aivAno acid.s 

(B) I'YPE: amjnc) acid 

(C) STRANDEDNESS: s.inqlo 



A:"q Al a C] y 

] :■ 
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1 04 



Aid a:., 



ill'-. L^'.u 



: 1 o Fi o 
2 .1 0 



Thr Arq G\v. Val Val f\rq Arg Val Va 1 Gly Sor Tip iii.s Gly GJ u Pr^ 

2^)0 255 



:-1et: r^ro Mo 



;;or Sor Ar^ Tip Thr Asn iJiu i! : - Thr Ala CW w Lou Pro 

:^^>r: vf,', 



Al -1 A. 



r: li : ■ : ; p rj 



a A,;, .,rq i,,>.. p,.,. 



■i!A;-iA'"Tr:;A,:T; 

a K ■ 1 : i i ■ ; i 
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Thr c;iy Glu yu-.t Ar^f) Ai r; :;rr Alri A:\j Cys Va I Ala Ala Ala GJ i; 

:o[, 

Asp Gin Tyr C 1 u Pro Gly ^^cr Sor Arq Arq Trp Ala Gii; Arq Phc Ala 
-^1'' 120 



Iam: Ar A:;n i;Oi: : a Pho 



fa c> A;ai G. w !a c Thr" L-a Mr 




. a 
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Ala Arq Ala Thr Ar q Ly.s Ilo Gly 
405 

Leu Leu Ala Ala Ia-m Gly r;^> l'r^.> 



L: i ■ I'll' ' e CW a L-lj Gi y 1 A 1 

'10(1 

Leu Gly Va.l Met Gly Leu Ala Giu 
410 

Lyr Asp Hit: (Mu Giu Alu \Ai i Arq 
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^'V- i C t y Ay,y. 



1 1 :) 



■r Jt.i Aia V.j 1 Ly:> C; ; y Thr A:ni AJ a Pr. 

1^-0 125 



Gin Tyr Tyr Va 1 Gly Ab-p Gin Pro Lys Phe Thr Met VaJ Val Thr Asn 



: 4 ^■ 



. a r\ : a v a i ia.a: A I rj 
i Oa 



. -1 ' 



A:;r: iaa: A:^[.. 
^ / ^; 

i'^a Gly Gla 
] 'i- 

A 1 .1 pj ,^ /ii f| 



Afa Pr 



•fA':P P 
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Va.l Arq Ser :,y3 Ala I'rc Gly Ala Tnr Va i Ala Tr:r l^hc' Gin Asp 

70 



Pro Sor c:y Gly Sor Arq Thr Va 1 Gin Vh.L Tnr J.p:, g] y Lyo /^i^:^ f ^ 
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1 7 0 



.■r-r i^ro 
"1 7') 



3er Val I U> Pho Arq .Scr Asp Lyr. t]cr Gly Thr :^^ei- Asp Asn Phe Gin 
180 ISb 190 



Lyr> Tyr Asp 

1 OS 



f^P -'^y lA's ':Uy Ala 5-;cr 



.M 7 



y Ai ,i :''-■] y l y Asn A:;;i 



■ I : 1 ■- 



7 4 7 



1 - ' : .1 (Ay Ai 



n 71 y A:-ii A:^p I.^m.i Va ! \,o.u 



:p :hr- S-r 



wo 98/16645 

"j I , 

G 1 n Thi li.i ;5 G \ Asp At q 

20 

Vai Val Arg Gin Asp Arq 

u Arq Ar'q li i t'lfv) Aia 
SO 

'Ay A i y A: q Ai ■ ; A; .j 

Arc; A:>p Arn ^ 

1 

■ ' ! :i !: 1 ; ; A r ■] ■ ■ 



] 10 



'.A I I A : rn 



:i iii:. Aly Ay;] Arq 



Ala .Ser Vai '3er Ala Thr 
' ' w ; y :1 i :i A ni Ara Ai q 



1 ■ ■ fA ■■. A : - VAi : 



1 " 



PCT/US97/18214 

- ■ i , u Asp 

Arcj Ala AJa Val 
30 

Ser Ala Arg Pro 

■vA.i.i Ala lA-o Sf*r 
A.A' Asp A: a 
- I CM a s 1 Pr^, 



Ar ■■] ; y Va 1 
0. •/ A A:q 



wo 98/16645 PCT/US97/18214 



(2) INFORMATION FOF^ SKO NO; 76: 

( 1 ) sFoup.NCF character: STJC^^ : 

(A) LFNCTJl: [,i]C) amirio .^rias^ 

;C; ST:xANr'I-:[^NrC;S : :i:i:(jir' 
JC lOJ C:,OCY; i mci: 



■ : 1 . ; r ' 
^0 



A 1.1 A. A 



wo 98/166^5 



PCT/US97/I8214 



' Yr.! 



V;i: Arg CA:\ A:-; iTr Ar.; /\ 1 ■ j :A y Mr^i I.-u Ala IJe lAy A 1 Va : Thr 

-10 ;m:; 220 

T]{.^ Ala Val Va 1 Scr A: a Gly lie CI y G.ly Ala AJ a Ala Scr Leu Val 

2 3b 2^0 

Ciy Pli^^ A:in Ara / J Pro /il.* Pro A^m (]] y C, i y I'ro Va I Ala Aia 

PPO v^-a, 



a: :-\:--' !P 



2 '/ 0 



A1-: AA-i AAi 



wo 98/16645 



PCT/US97/18214 



V.i 1 Va 1 T!:: Af;:; /^--f. : y;^ A::f' 
bQO 

Vdl Val Ala Giy Gly Aid Ala Ala 

bib i>2U 

V a L Val T hi L y .s Wil /\ .s f ; A p /\ r f j 



Asrj Ala Gly V<il Pro Lys Gly Val 

: j 2 5 

Pr*^; Me A.sr- 5H>r A ' - 1 Asp AJ a Pen 
b 1 0 



wo 98/16645 



rCT/i;S97/18214 



] 14 



:>itn G]y Soy lie Sor G\u ].c.:i :;.'r Thr 5^^r Am; Va 1 ].ir.: /\r,p Vio Ai a 

130 1.^0 

Ala Gly Val Thr CJn Lou Lou .Scr Cly V,] 1 Thr Ann Lou Gin Ala Gin 

^'^0 ibb 

'*y ^^^^^ ^i'" c;;-; .^^r TLr 1 h : i,v:- : ' "hr 'Wv^ r 

L ■ . ' j u ' > 



wo 98/16645 PCT/US97/18214 

lis 

: 1 ) ::iAMIF^Nc^ ri;AJvArTi-;R!;;Tn':;: 

{[)) TOPOI.onY: iHioar 



I 1 o 



I J Ai< 



wo 98/16645 



PCT/US97/18214 



Aid Cly Ala Thr Ar.p i I A:^ri Ala ['\i>:'. f^oi V.j ! C; 1 y Ser 

100 10b no 



My Gin 



Thr Tyr Gly Va 1 Asp Va 1 Val Gly Tyr A.^jj) Arq Ttir r; ] n A.sf) Val Ala 
il- :20 



wo 98/16645 



PCT/US97/182J4 



117 



(i^) TYPK: ciruriu dcid 

(C) STr<ANDKDNE33; .smgJe 

(P) TOPOLOGY: linear 



wo 98/16645 



PCT/US97/18214 



IIS 



(D) T0P0L0C;Y; 1 1 near 



^:-f- .'cm ■]■!: : :i Al.i a:.: J^.L:y ^^ V A.;i Ami Aiy Pnc> V>m 

M. ■ j 

My /^Ai Mir /Mt; .Ay Am; An: A:tr ']:m Ai; rA y ]..M.i CA n 



.A . < i 



>1 /A:. A\M 



6^ 



/Mi . M : 1 



M ; < ^ . M ; 1 : ■ I 



wo 98/16645 



PCT/US97/18214 



2'\'.> :V>0 2S^) 

Arq Va.l Asu lie Asp Pro F^ie (Uy Ala Gly Arq GJ y Pro Pro Ala Glr: 
?60 :^6b 2/0 



1 i) H 



wo 98/16645 PC T/t'S97/I8214 

1:0 



I ^RMA'!' i 0:: yi'R : ^ ■ HJ : : 

U) SKOUf^NCE CtlAf</^CTF:RISTTCS: 

(A) LENGTH: ]() / amino acid.s 

[H) TYPE: an^irio acicj 

((') STKANr^F[)NK;:^^ : .■::n:i:.- 



wo 98/16645 PCT/US97/18214 



Veil hy;^ (Uu Pr o Lc\i Xaa Thr Giy lie l.y^ AJ ft I 1 tj Anf^ Ala Met Thr 
bO bO 

i':- 



wo 98/16645 



PCT/US97/18214 



12? 



1 : ^ 

( 1 ) sr.o^'ENCK c:haractf::ri.^tics : 

{A) hKNGTH: 103 amiiuj a;:ici.s- 



wo 98/16645 



PCT/US97/18214 



..1 Asf. >;m. 



U 1 1 



S-^r Mo^ Ala Ala I,y:; f r r: Ar q Tnr r; : y Asp Aiy Pro Lou niu A] d 

Thr \.yr> Glu (U y Arq GI y Mo Va 1 Me^ Arq Va 1 Pro Leu Glu GJ y GJ y 

SO 6U 

G ; \' .Ar.: J ^'1. '.G-: !■■. Irv. As;^ :A i: AAi Aia ^^.ia I.t-u Gl y 

- HO 



■ r ; 



.A ■ l.iGJGG:' : ' v- i^. i :.: 
rvia-;; .-in:::.:- aoici 



wo 98/16645 PCT/US97/182I4 

1?4 



Mot. Thr Gin Sci Cln Thi Vci ! Thr Va 1 Asp Gin ''i.l i 



I U' Leu /\sn 
1 



wo 98/16645 rCT/US97/18214 



(A) NGTH: /u'i <KTii no K-^cis 

(H) TYPK: ammu acid 

{C) HTRANI^EDNKSS: ^ririqlt' 

( D) TOPOLOGY ; 1 ; noar 



wo 98/16645 



PCT/US97/18214 



24 0 



Sci Val Arq Tyr Gly Ili.s Arq hsp 
24 5 



Giy G.ly Lys Tyr A 1 Xrui Ger Gly 
250 25S 



A r rj A r :j A>; m G 1 y (. ; : y tG < - Air! 
2f)- 



:g-;ng': ii 



-t' -1:1 Gi 



wo 98/16645 



PCT/US97/I8214 



127 



A.:: )■:■.' i'vr 1 y VvJ Ci y A 



■vr i;.y niK Ty: f;:y t^i:: '[yr (r.y V.i 1 Pr.) C.-.y r; j y -yr Tyr Giy 

IBU ie!j 190 

Gin Gin Gly Ala G.ln Gin Ala AJ a Gly Lcn Gin Ser Pro Gly Pro Gin 

;,oo ;^o;> 

• 'r^r. t;:y Tyr '^ly G-- Gin Ty: Giy ,\]y Tyr Gnr 



iT f GK G-:K : \) fPG ^1 : : 



■ii: G;; G": Hia 
/ ;G") 



G IKGG; 



w o 98/16645 PCT/IJS97/18214 



(?) JNrOF<MA7'ION VOR SEQ ID NO; OS: 

(a) SKCJUENCi: CHARACTKRI^H'ICS: 

(A) LP:^:c;T^:: l .irrunr^ .>':ifiy 



; ! V Pr 



wo 98/16645 



PCT/ITS97/18214 



(P) TOPOLOGY: 1 incai 



. ;■: ■TO';'! : i ' 



wo 98/16645 PCT/US97;i8214 

130 



;;ln Tyr ih' .,.v W^: Cli: l>'i \\j 1 Aid G.y Sei Cy^; A;;r; Asii I'yr 

.H', 90 9!) 



(2) INt'ORMAl'lOr; r-'OK Sr:Q TD N0:98: 

(Ai .,K:CA7t:; .S^; pa::. 



wo 98/16645 PCT/tJS97/18214 



ID) 7OPUL0>;y : ] : lirM i 




wo 98/16645 PCT/IJS97/18214 



TGATCGccuA GAA:A::n'G(^'; g/vActc;atuA t7C7gai'a:^c; GACCAAc:t:'j'i:' ':";'gc;l;c;ca7va 

ACACGCCGUC GATi:GGGGTG iV^CGAGGCGG AA'J'AGGf-iGG/^ c;A'rG'l'GGG(:G t;AAGAC(;CCG 
GGC;';'r;ATGTT TGGCTAGGr.- " r;(;(u^(:r.;Ar;:- r ( ;aG(v- n \ JAG 'IGGGACG'rTG ^■TGG'-r;7TGG 



'/GG 
7 HO 
8^10 



AAGTTGt; 
AA:"-GAG( , 

■ :aaa:;a^ ; 

. : '/A. ■■' ■ '[■■ \ 



1 -iGO 
1 .1 (.t 



wo 98/16645 PCT/US97/18214 



I ic: -...i.cA/v.'A yu;A'; ' K^'.-' ;':a ^a' ;aa/-;A: ■ t; .av( -a ^ av.A' 't* ,a: ■ ■A'.a:'aaaa, ;AHi! 

CJCCAAGGCCA (>:A^ACGTGS7 ArAA; ^Ai iTGA AG rTCCiTGGC G7(jA7CC'::'G GGGTGv^CAGT 7 4 0 0 

GTAAGTGGTC AGTGCTGGGG TGTTGGTGGT TTGCTGCTTG GCGGGTTCI"! GGGTGCTGGT 24 60 

c.:ag'1(;ctgc:t ggg( A:'rc(;c;(; '.'gagl;agatg gag(^ca:cagg TAc.c:Gcx:G'rG cj iCGATCCA 2bzo 

■I'TlAArCGTGA TG7TCGAAAA ■ :GA(^( :';GArA C;A': 'GAGGC AA: A7\;ATCGAGG G:a 'G AAGGG; ■; ;'SHi.; 

-AAGATGA^A' AGAA:AV:-AA 71-^ U ;;a;'I C; AAA-TATAGC: T?r;A(AA:GrT ^;^-rAGGGGi'T 2b40 

' A , .;AG: "A,; IW'Y'WW.r.,'.' , .A/iAA':;,^ AA ; , A. • ; A/^AuA A AA A\AAAf.AA. A^ AvG A'JAAjG':' lA/'IiJ 
I - ^' A-A'rr- A' "GA "7TA 'T ■ A ir'AAG'^A^ ' A - ;gA ■ii i AC A A';';A( ;AG A^";T''AiAG^7>r 

G'A;A7AA7AT 7(7;^.:. V.A-AA ■ T.7A77 A ' : A^'^ ■ ; ^77. ^ 7^^GTAGA7G^ : A--77-AGCAA 7>A.~0 



wo 98/16645 



PCT/US97/18214 



r^4 



H'^ 'n'.' 

Aid 'I'yr CUu Thi Aid Tyr Giy Leu Thi Veil Pro I'ro Pro Va I ilo Ala 

100 105 110 

G]u A.sn Arn a:.i Giu lyw Ket IJf Lf'-j : \ r- hi n Th r Asr: I.r-i Lev: Gly 

170 

■.Pn 'I'm i : /^ /i.G; 1] \- /\\ a '.'..i.. A.-i. GTi A , -i C- \ \i '.y: '.G y Gl u Mr." 

Trti Al.i (A:, A^ ; ■ /Am Ai i AG; '!-■■* iA-' Ay Gy: A Aj AAi AA; Th:. A Ai 

AA) : 'r. 1 f.C 

T:,:- a: j T:.: A : ■ i A.: l.-'; A>: ii J':..' A': A,.i ) :Ai X-n. Tt- 



][■() ;9:> 

A:-:p Tlu. Ai-i Ala /A <i Ar:i i,'-! i-!- ■ ' A;.;n m:;:: V<A ?A ^ ^ Gin Ala Lea 

1 ^A .V ,A 



wo 98/16645 



PCT/US97/18214 



135 



: ! >■ -^y '.'^1. ;-r : v.: /ua I'r 7yr 

Pr !,) !i 1 :i r : 1 :j Ala Ala (;i y 
38!; 390 

(2) INFORMATION FOR SKQ ID NO: 103; 

(A) I.KNGTii : ; 72 j O.isf ; : 



wo 98/16645 PCT/i;S97/182M 

1 V) 



GGCCC'rC[]CC CTCGACGAGC GCATCCACAC CGCCCCGCCG AACCAGTTGA TGAAGAATGT 
C^Gr'GCAAGCC; GTG('^AAGAi >r Tf ;( ^GCGAGGr' A^^GC^GACiGG':" GTGGTAGGTT i^'IG'v T^AACjCJ' 
(;GG .'GG'-Ci'vi 'l\}Gr.r{'"rp.y] GGT^ i'/G-^.-/: VGTG'iGGGGG GTGAGGAAGG G( YvC/rTGC^A'l' 

■■'G g-.a:, g.;:"gta ' g' ■ : ' r. 7, , .;AAA: G^f :G( ^ ' 'G; :aa/iA("'(;;.; 



i 260 



:GG.GG AGCGlfG'. - G.-^' / 



wo 98/16645 PCmTS97/18214 



■li. Ash Ai T:ii C> Ui l.a: y.'^l i'!.; TLi A:;:: i.e.: I.^ni C.l y 

]1S i;a; 

Gin Asn Thr Pro Ai a lio G 1 u Ala Asm C; 1 n Ala AJ a Tyr Sf?r GJn Mot. 

130 13'; 1^0 

'^'m^ G;y liiii /^;;f- Ci;. Ai.; M'.'t 'i \' : \' : v Tyj Air; A ; a 7!.: /A . i /^la 



;ia 1- 



wo 98/16645 PCT/US97/18214 



(XI) :;Kcnjp:NC}-: i-Kr^r-Ri pt:i on : r;K-j ii^ nom.o^: 




wo 98/16645 PCT/IIS97/182I4 



/\f •t.^Mv:;v 'v ' '.v^a'. r:;; ,i ■ ■ ; /v f^. ;c.'f 'a ■ >; :a': ; av: ^t^ ,a ' ' "ta 'A/^A^'iAA' a; ]'^;;) 

AC; AiAAATAA ACAaA'I A A AA AAA'I'TC:A(.'lA' AAA? ATAGC ;T Ac;aC(;TAAAA CArrCACCAAG 1 MA) 

TCT.";ACGGAG ACCCAAlC-rrGS CTGCTCGTGC c;AGATAt:CGG CCGTTCTCCG GACAATGACG 1560 

GGCj A.;ACCTG AAAGAGAAAI' CAAAA/GATGA ':TAATCt;GAA GC^GTTATTTA AGATTAGTTG 1620 

GCAGAG I'AT'!' T A.'A'TGAT . ^'f 'rGAAAATGAA . ' AG' 'T AGAT'l T AGA^^A ■ I*-'' ; G'/AGGC;CGGC ] ofAJ 

■'"G;";g:;:a ; vr;. ; A ';A' \'.A;A' ' A,, / , AAAAA. 'A. /GGGT'IGGC^r' iRtv; 



" - ■ : 9;A.; 

" ':vaaa;g(:a :'^^u^ 

'■(iGr-GTAAGG ;::00 

7""^'^AG/G[ AAA ,/ 1 t)C! 

/■.A'A^ ::aacuaa r 2 n 

■ ■ 'A ■■'■T^-A'-. / A1 ■ ■ 



wo 98/16645 



PCT/IJS97/18214 



140 



(A) Lt^NGTH: 396 amino acid^? 

(H) TYPE: amino acid 

(C) srRM:DKiv.<iKr,s : 

{])) 'rOPOI.O^;Y: 1 1 



wo 98/16645 



PCT/TIS97/18214 



141 



.i^' 

y'Ai: Tr[> l.yr, Ala 1 iW^v Pro H:s L-u :.^v.-r- i':,' A;;ri 

lie Veil .Ser Met. Lou A.sn Asn Va 1 ^U^r Me* T^ir A;-r; .Se: ^ily Val 

S*w r-:--! /v ; H m ].-;; IJ i s r-;-^ I.-^:: :/;/;; 'ily i::,- A.! a \-io Al.i 




-1 



wo 98/16645 PCT/US97/18214 

142 



h(:j':cTG(^iv (^(^Ai v^ ^ -A. 'AA ■■;!;. ' A 'r'"AAAA' ";■ A':aaa(;':a*'. ;a 'i'^ y.yi^y vrc^Tc, 

CA-'"ACAAA(^A TAArrCAAAA;- AAfir-^AAAAA -AATAAAATC; AAr;ACC;AC\ ;A C( A:A/A;CC(^C 3ul; 

G(;CATACACC CAGGCCA'VGG CCAGGACGGC GTCGCTGCCG GAGATCGCCG CCAA'^CACAT 360 

CACGGAGGGC GTCCTTACGG CCAi'CAACrT CTTCGGTAl^G AACA':GATCG i.^GATCiGCGTT ^.^0 

GAi T'CAGATr. fiA'l'TAl'T i'f'A f ;T AT^T> ; ^ ^A/\f 'VlfW "A ■ ;^ "f'T^TGf ;r:AA ':t;C;AC;( ri'f^^'A 

■.'.a:^^-'--' ; T ■'■:'GA 'A- .= : ■ : :-'rA'--:-A''; A( '.g ■ ;ggg:^ ■ /.AG'rr.f ;gtc;a (.bO . 

' 'G/. A'AACf ■ GAAAC'/f;:, a; a: \ :A'/aa. ■;■ ■ A' -a^ ■ ■ .aaa; ■GA^■' ; ■ A-.;AAf ^ aaggt "g^' 



GGAGGCA 



^::A';<:GA7'i;C" :;^""A:" 



wo 98/16645 PCT/US97/18214 



(>::) SLQUKNCK Dl'l.SCEU TT 1 ON : ^]EQ ID NO: 108: 

■TtxiATi; (;c;/vcoA'rc;(;(: o/vrrr'i'f 'Kir a( ;":otoact( ; (yrTiC^'rcrcv 



-j,C AT 'TT'Yi.; tA) 



wo 98/16645 PCT/US97/182I4 

144 



i:)'- wo 

H.i y. IJe Thi uJn Ala VaJ Leu Thi Aia Thr A-iii Phe Plu^ c;iy IJe A.sn 
lib 120 12r^ 

i'hi- lit,' Pr^~- ilf Ala \.o\: Thr G.ii: M-?; /LSfi I'yr Phe J 1 f Arq Mf'l Trf.) 

l.'^O 13': ].]^^ 

i\r-i: C/. \: Ai-: A ! P-:: AAi f-P- ' PP. '.P; 1 y : P!r. Aim P ! : ; PPi A.: ] V;i I 

MP IP^] I'P: lP;i 

T:: r P >; : y.; i.- i ... - iP ■ A I .i P-r : : ■: P.-:: A:^;p Pr 



P.y P.y .P u :P ■ t 

PIP 



wo 98/16M5 



TOPOLOGY: iiiu*a: 



PCT/tlS97/I8214 



14: 



;j K.-^t -f:-- ;■ 



11-: 



1 r o ; I . 



w o 98/16645 PCT/US97/18214 

146 

Ar'AC ;TATr: ^ AiiAAArA'^ n ; 
CTTGACGAGG G(;AAC;C:AGT 
C') INFOimATION FOH . 



A/\i i! 'G GA A'!'' " ' 'Ai T. iAAAG- i "A- ^ V^'i I 'AGG ''''"T:^ J(>'i 
G GGTGAGCA.AC; GTCGCA 3 96 

SEQ II) NO: 112: 
^RA'/TGRl :^TJ r^S : 



wo 98/16W5 PCT/US97/18214 

14-7 

'jTcxx'Tcsrc AA: ;3AoC;.;aa T.;AATL;GAro iv;a::'a'J':'tc(" (■':c;wAT'r;-/:; r'Trr irTijCGc; -sou 

CCTCGATACG CGOGAAATTC CAC^TGGTGGT GTGTCATGTT T'rTGCTCCGT TTCTVTTCGT 3 60 

ATTAGCGGGT CAC^AAGCCGA TTTGCGA .^8 7 

; ; : NKGi<M/\';G u': ^-'^m y\y.o 1 rjo: : : -i • 
; ^ GKow:G-;rr: ■'\\r^]'^S\C'yKii:.:'.T: '-^i; : 

^G ST!- AGf^fG^GiGGG :;:r-jj- 



GiG;<.'K:F'T"I(>N; 



G •■■:gG'' a.;aG': 
A'-'i 'rG:"^TA^r 



wo 98/16645 PCT/US97/18214 

1 IS 



(C) 5rrf{/aJl>K[)NFS:i : 
[f)i TOrOLOGY; l^nea: 



^J.i V.ii ..:.v \. 



wo 98/16645 



PCT/US97/18214 



149 



f"^ ;;'rKAr::':-::.)rH':ss : 



wo 98/16645 



PCI7US97/18214 



V 71:: A^;p 71:: Gl 



wo 98/16645 PCT/US97/18214 

151 



.Mr 



wo 98/16645 



PCT/US97/18214 



]5: 



(A) LKN:;TH: 4 ajiuno 

(B) TYPE: arr;ino dcid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: Linear 



^A; Ll'N-TIi: T) 

( P ; :"'P : ; : lo ■ : ^ ; 

PTi-A:j:'?;p^NKSS: 



wo 98/16645 



PCT/US97/18214 



{[)) rorOLCoY : l.Lnoay 



(Ki) r^riOHENCF: DKSCRI PTU^r^ : SK<Y i.i' N^'^:13]: 

\To Gi\i Sor <nv Aid lily l.'^;.: :;ly (My T]i: Ya 1 



A Ml Gly 

1 ^ 



: GFOHMAT :0N PV)R :^?:c: I V ): 1 'M' 



I,Kr:GTri: ri in::;') .i. 



wo 98/16645 PCT/US97/18214 

1S4 



■ ;-./"''"':;:;':rrA^' rTC'A. "; ■ ;a -'■:'C'r:'/y':rr ~]:'C7']-':r,rr:] :J^;,^^:-]cc(](^ 

ccc;t::-;!.:ccuc; rf.;TTf";r:;(::'T '.;ccccc':]1'Gc iiAh'TCcccGr ccc^Arcxrc attcuCccgt 360 

TCCCGGGTTG GCAGCCTG3A ATGCCGA:CA TCCCCACCGC ACCGCCGAGG ACGCCGGTGA 42 0 

CCACGTCGGC (^ACGACGCCC; CCGACCACGG CGCCGACCAC: GC:(^G^:;rGACC AGGCCGGCAA fl n 

GGAGGCCAi^'G :-;/UX:A(A.;' 'GG GIGAGGAGGC; ■ '( V\A' 'GAG i';G;;G'""GGA:"/"' AG(';r;CGGl'C;A SmO 

crhC^.r^-,-: hhr::r,fcrv::: GG' a -r;GAG- -A ■Gr^yyr:-^-' r;A: G;AGG-"'I ■ ' G:":^^' X^AG^'A GOO 

0:"0 i'G'O.'O;. 'G GAGG7\G(GG\"^ 0:G:0GAGGGA ^ 'G' ■■7'.' '('■'■ ■ -'-AGGACiGGGG G':"rGGG(^A(OG hhO 



G^:' 7^1 GGOG':' : 1 .: r 



O-G ; r I '[G 



wo 98/16645 



PCT/US97/18214 



1>> 

:'cc:cc ::gaag gaggccctga AC-TC'iciGcm c;agggga7'Gv. [;ggatgggtt Gc;GGCAC7rGG 66o 

CCAGGCCAAT ACGGGGATAG CGGGTGTG^NA AGCCGCCGCG AGCGCAGCTT CGGTTGCGCG 72 0 

ACNGTGGTCG (".GGTGGGGTG TTACGGGCrrT GTCNTCGAAC ACGAGTAGC:A GG'l'GTGCTCC 7H0 

GGCGAt:;GGGA TG: V^fv- m^Tr;:"-^ Tv ^ A' : GTGr;7 H i 



wo 98/16645 



156 



PCT/US97/18214 



' ^ '^^^viL - 1 ,:. I; /, / . ; ■ I A' \ .7Ayv; i \ "A' "T^A ; AAA< AX. Aa XA ' ^hO 

cgc:aa;cca;a ataccaacag atag(;a::"aaa gaattc;ccga ccagccgatt gga(;ccgtc^c :o;^n 

ATA::C']CCGG CACACTCACC GGCAGCGAAC AGGCCTGGCA CCGTGGCGGC GCCGGTGTCC 1080 

GGGTCTACTT CCiAGAGGGGC GATGACGTAC^ '{GAAAGCiTCG GCGCGACTrG CATTGCCTCiG 1140 

GAI'CGGGAGG AG ^ ; 5^; 
(/) : MAJHMAT"! GN W- n< AP:'.j ! il'':]'<h: 

(A) Ai'N'r^AH : tA- S i > ist' [.■•:( s 



wo 98/16645 PCT/US97/18214 

IS? 



(XI) SKOUKNCK DESCRIPTION; SEQ ID NO: 137: 

Av>n Ala Veil Vai Ala Tho Ala Va 1 1 ,1 c Gly Pho Ala :;or i.^-a Aia Val 
1 :() ] !j 

A:a VaJ AUi V^: li.: : • A:a la;: ^hr A i a Al.i y>r.i :,y:, t'f:* vat : Clu 

a) 



Tar I 1 a laa^ /'a i = . : ■ ;i a Aar: a.' ■.■m: ! a- a Ara 
■/a '^a 8r: 



■i":a P:a ^ A^a :''a :a fa AJ a P^^o 
■la 



■ 1 ■/ M' t 



;'ra ! 1 



wo 98/16645 



1S8 



PCTaJS97/I8214 



(2) INFORMATION FOK SI-:Q ID N0:i::i8: 

(i) SEQUENCE CHAFIACTEHI ST I CS : 

(A) I.KNGTIl: 17-1 ammo .icidr, 
(H) TYl^K; annno acid 
(C) STr:^:s![)f.:nNES': : 'vi. nglf 
TOVOlAJiy:: :ir:e:i: 



'JU ;'T"i.:^:j: .:f;^ - i 



J 



:u) 

t'""^' i':^ '.:y '/rf-^ ; : A! ; a;,, : . v^-, x^^^,^ 

71.^ . a;.^ IA, li . I l-o M.- fr- [•^^i 



wo 98/16645 



PCT/US97/18214 



TYPr': irri:.:' ic : :i 

[':■] :rri<AtJi.iK[)NF-::;:; : i-mqli^ 

i T()rOT.OC;Y : 1 i nn.ir 
[ii ) MOLECULE TYPE: peptide 



;A; PP::GT];: i;-; u-wk-^ 



y Giy 
J 



Pi' Z'.:'] TP: TP: AAi 'M:: A:.p i': ^ A:; \ i -.} Ai q 



wo 98/16645 



PCT/US97/182I4 



160 



1 

: Nl-'OKMAl' J ON yOR :jK'J JI) NO: 1.41: 

{]) .SKQUKNCK CHAR/iCTKRISTICS: 

(A) LKNGTH: .S3 base pair^3 

(B) TYPE: nucioic acid 

( C ) G RA N [ 3 E D N t : S S : singli' 

(D) TOPOEUGV: linear 

Ml) MOJ.KCULK TYEE: ut.h'M aiiul-:- 

[A) [)E:K:HIPTK)r:: /ds-^r ■■ "T'^H primer" 

; V i ) OKIuTNAI, SOURCE: 



{:-::} .W'QUErJCE I .E^-rR I ! 'T T ON : D NO : ! ^' : : 

;v:'o:.;at7v': rw^Gi.'Cfy:rf,r oaocatcat." A'/i^o. :A'r:": :a ^'ryr"-v\ 



^I--0H?-:AT ION EOK NE'J : !) No: ] 

; : ; AEQNENf'^l On A><.7i''Tr;R 1 N7 1(0" : 
>.A; EENOni: -1.^ baf;."' f-aio 
(:0 OYFOO iiarlf.-i^- ..i(_Md 
(o ; NA'i^NI^Eny^E^A? : n'^ 1 
00 ToI^^lEOOY : i in-a:. 



:A. no: A'O : TO);): 



INA;, .Ak:K' 
OROAN ! EM 



wo 98/16645 PCT/US97/I8214 

16] 



c;gatcc7gca ggctcgaaac gaccgagcgg r 



wo 98/16645 



PCT/US97/18214 



(A) OHc;ANU;M: Mycobcicter 1 UMi t ubercul j 



;ag;\/v'['Tv" 'riViiWKAi^r^^: A'rT'rA^;;;A(;'; a 'a 

1 t J r->i^:'i AT I I )N i-'AH Al-X' 'i: TJ'^: i 4 : 

■ i ; i oi C 'j .* >; i : i : n. ■."M 
. V. ■■ . ;•. : AINAl, A< '')K(:K: 



TiiRi': 



w o 98/16645 PCT/US97/18214 



Asr; V,-. , I : 



y Th? 



■1 ; 



TCT GGT GCC CGC ATC GCG CAG GCG GCG GGG GC^G ACG C/IC AAG ATT C^GG 
Ser (;]y a Gly lie AUi Gin Ala Aia Ala Giy Thr Vai A;;n llv. Gly 
9[) 4') iOO 



460 



GGG 'iG'G CiAC' CJGG "[■■/[ GTG G'!G GAG -'G' (iAT AGG 'i^'G f;:''; ' 7\' " /iAG r:G; 
A!,i G^'; /^:;p Ala 1 \-; i.'-i; G"r GJ'; ' G v /\t\\^ NG-t A].i Al.i 1: : ,^ ]■/:■ C \' 



vAf A'! 



G' /■ : a 



Gi.^ I. 



wo 98/16645 PCT/US97/182H 



Gl:; Aid i. .1 e Se; : 1 /vsf..* -^y^y l-:.> Aia Pre. A:;p ] y 'Tyi iTi' 1 } v 

3U0 ^0'J 310 

ATC A/^C TAC GAC^ 7AC GCC ATC GTC AA(^ A/^C GGG AAG GAG GGG GGG ] ] 32 

IJe Asn 'i'yr Gi.'.j Tyr A] a J I v Vd 1 A.';:i A.sri Arg CM n J.y.s A.sp Ala Ala 

Tl^i G'f) ■iJG". 

a: ; Gl:: Gt:r G^': :.: AG; : /. G:^ AG. I G. G;:" A.;p Giy 



•TT 



wo 98/16M5 PCT/US97/18214 

1()5 

Vai Lys ilo Arq l.cu Hi:j Thr i,eu Leu Ala Va 1 Leu T\)i Aia Ala Pro 
1 b 10 15 

Lea Li'.u Leu Ala AJ a A. a ('Ay Cys r;]y So)" Ly^^ 1-': o I'rc Se r :;"!y Scr 



/J : ■ ■ i A 

1 ' ■ a S ■ ■ ' ; ■ 
] 1 a 

1 



wo 98/16M5 



PCT/US97/18214 



1 66 



Ala Tr:.: V:^-^ Aid A:^:; ^5 ! ri Ala Lie T'cr Met: IJe Asp GJ y ['ro 

2 90 2 9b .^0 0 

Ala Pro A.sp Gly I'yr Vro Jit- 1 lo Asn Tyr Glu Tyr Ala I Jo Va] Asn 

30b '^10 31 S 32 0 

A:,i! A:w Gin I.ys Arp A!:: Al ."s Ti.v Aia iG :i 'IGr L-m: Glr; AGj Phv G^-u 



..■/:: AG. ri.f/ l,..a Gap -Gri Va 1 



.1 -1 ^ 



: G i TV?r : :)a.:G ^a : . i 
■ ■ , ;'r[<GG!MG)!jraG : . 1 . 
T'';'al/)av. I ; 



wo 98/16645 



PCT/IIS97/18214 



lf>7 



TGACACC^TTC TTCrfTCACCO AGTAC'JTG'rc: CAAGGAAGAT Ci^CGAGGGGT GGGGCAAGTC 
GGCCCGCTTi;: GGCACCACCC^ TCGACTTCGC GGGGGTGCCC G( ;TGCG' :T(;r; GTGAGAACGG 
G/^vcx^G('-GGC ATGGTGArT'c, :]'rYGri;crGA (]Arf\c(r;(\(]c ''■(■'GnTf;(';Gr'[' ATAT(::r^GCA'j^ 

■ :AGGrTGGG ■ GAGv'A'."^':' " "A ■ /AA'.""' "GG^ ; A.^l "GGG;AG GG- 'GAAC 'T AG 'G^AA^'ACJ^^TG 



"/ G I ! 

7 8 0 

8 40 
90 0 

1 OGi; 



■ ■■iGG'VVr:' 
'TGAGGCGG' 



] ^ A 



wo 98/16645 PCT/US97/18214 

I6S 



(xj) SKQUF.NCE Dl!.::^CRl PT I ON : SP:0 ID NC):lbO: 

lYf/t Lys J "1 r- Arq J.^mj H.i <: Thr I-ou Lfu A i a V.'i I I,pn Al.i Al>i I'rc^ 



wo 98/16645 



PCT/US97/182I4 



169 



rh<: Lo;] L.eu l-^ro Ast* A i cim :;cr- li*; Gin Ala Ala Ala (]]y rii 

27b 280 ZS'j 

Alci Sor Lys Thr Pro Ala Ar^n Cln Ala lie :-;er Met .1 J o Ar.p n]y Pr- 

2 90 V9b :U)() 



A ; ^■'r:^ A:;f^ 1 y Tyr 



il A.s:: 



^ 1 



■Nwril; ;77/ \ 
I'M,: rru lf'i - 

■■ >• '. "Wi 'I J ;A ! y : 



wo 98/16645 PCT/US97/182I4 

170 



TGOAATCTGh T::GG(]h'YGGG GGCCJCGGM^ AA(K; I'CV^Cr C Ch'rCGGCACC GGhZCGGTAT bGO 

AGCTATGGCG A(^CAACGAGA CTTTTTGTTG GCGCTCTGGG ATGCGCTCGA CGTCGGCGAG 720 

CACGTGGTAC Tr;(VrGG'rGGA Cr;AG7r;GG; ;G TGi^GCGGTGG 'TTGG ACH'G ( :Gf-TAArCAG 78tl 

CAT'JGGGAC'G GAGTi;GAi"(:v; GATC^GCG':":' AT: ^^AA.K'CvA T r\ JT^ 'ACG ^ " ' :;A':'CDAGG'r(^G : i 

"GGAGi GG'" : ^ 7 ;; ' ; \ ;■ ;gg';'- ; T : ; i "g.-a^ ;GG';' r ':'G( \.;/\'rGG'^' ' 7'GAhGG<^'G_:jG,' ^vy: 



G'VGGTGGGh 



AGGAGGTG;'; ] ?/)■..] 

.A( >GC;Ai'GG; ' ] 



wo 98/16645 



171 



PCT/US97/18214 



cK:ccATGTrc: ?cc^::tgtcga ccttcc;cccc atc;cx:ccx;ac: [;ttc;ctaaa(' ccagggtttg 120 

at:::agtaatt gcc^gc^ggacg GT'r(;cc;GGAA ggcgi;ccac;c^ atgtgggtga gccgcggcgc .ibo 

•'"GGGG'"rT;rv' r-AGGCA:A'":^'r; Glf ;r;ATr:':-:\':: AG:''f'CC;GT'G ^G(;c:GACGT^ GGOAGCGTT'r ?Ar, 
: .:iG.)ULNr:-: 'iiARACGKR.: GT.IC;.: : 



1 



wo 98/16645 



17: 



PCT/US97/18214 



CGGTTAACCG GCCAJJAGGGT CTGGACGCCA ::(;aggatc"t gcacatcttk.^ ag(t;ccgagt 
CGGTGCGCGC CGACC:GGCTG GACTTTCGGC GCAAGCACAA ACTGGCCCTG T7GGTGGTCT 



CGGGGATGCG GGTGGGGGAG GGG(.'J'GGGCiC: TGGCGGGTA(; f-ICGCGA^ 

rf-A(;' -TrAji^:- a; -ag^^t^:-'" ■ (■T(7\^^ ;gc-aa ■.■g::ggcgg ; \;GG(/:rA 



.i;g7tgc:a 



i'l GviCT T 



I 020 
1 OHO 
1 M D 



( iG 'rvrp; : nvicG j i d 



wo 98/16645 



17} 



PCT/US97/18214 



G/i/vGAcccGG ccccla.:cata tcgatcck^ct cc;gc(;actac: tttcg:;cgaa cgt<:;cA':gcg 

GGGGCGTCGG GCTGA'j'GA'l'G AC:CGG'rGGC:'J' AC:GGC;GGCAA CCGCAGGGGA '!'( ^GCTGGTGG^ 

^:(;ttgg(x:tg rGAAr'r^\/:v- Af-Tr-'Grr-i ^Ar;GGrv-Arv; '.y:AG':^GcrGA atga^cagg^: 



wo 98/16645 PCTa)S97/18214 



Trp Arg f\r.:\ lie Met Pro liis Leu Glu Gly I.eu Gly Arq l.ou Val Ala 

130 135 140 

A3P l,'.--^ ]lo (.;]v* M'^t. vUy A1:j A>:t) I.ys :lot Pro Svm" (Mv 

I'l^ ]■'!' ::;:> j 

1'"-' V/^, 

i'-^ A:^f. APi P'.-i: A::i P--! -i^y i! i :j V.i : Va ! Au VP Leu li---: A:,p 



i 'rie' Ai {; :P'r 
'1 r; 



wo 98/16645 



PCTAUS97/I8214 



I7S 



Ai:'^ .lii -MS 

S'^r Gin Pr~o Leu Pro S(?r G.lri Arq Gly Arq Gin lie Ty]- VdJ Al c3 G] y 
420 -125 '130 

Cr. ri Ai'g S€^2^ .Sor Tyr I.c-ii i': d So: Gli: Leu V.i 1 A.la AK; i'ho heu Ti-p 
-IXS 



Gi'-y:;i'.M. 



wo 98/16645 



176 



PCT/US97/182I4 



Ala Thr Va.l i,pu A] a Aia Va ] Wis 
] 00 

Pti- Va: A.] a v.:a; :.>'-a 

n 'i la > 

Aa: > ;.va /aa^ v.i 1 '/ i 1 L' - ; : : i a A: a 
1 : - a 

, a TLr A^^a ^-1 : a ii ; a ',aa" ra ■ a i a 



; - >a? Trfj I" ! C.J Ar (j C, J n Ar'a G 1 y 

90 q', ' 

Glu 7rp Pro Pro JJe Vai Val His 

lOh 110 

Aaa Ara a: o C 1 y a; r. In s P; o Pha 



/\) a Ami ^ ^ ^ ; A: ! . ; Ar a 



wo 98/16645 



PCT/US97/182I4 



177 

/\ V'. I AAC A I^ I 'AT'. XiGV'. ■ : 'A'.'AA' T '--'r ■ ■ ,;A" 'i 'AT ■ ' AVA' ]T'' " AA (A: 

■.jv:aa7a;(:aaw AAA'i'CCiCAAo Tr'::.'T':"c: ;aa wTatt:;- "a. 'r- aA':^agata; 'a -A';aaaa(Jaa it;; 

ATCGCCGTCT CGGTGATCCA GCCGGCGCTC. ACCCAGAGAC CCXTGTTCJGG C/iACGTCGAC 18 0 

CCCGCCGACA TGCCCCCCCC GTTTCGGAGC CTCACGCCCIX TTCC:CGT7CA CAGGGTCGGG 24 0 

(TCAGGGCriTA;- T'TTAGGGTAA GriAA ;.^,,-] 




wo 98/16645 PCT/US97/18214 

^^;:AC::ACCy- TCr:G?T;^^';CA V-LA-(y]f\'rr r;;.-;c; ;\:Cr^A:^A ■ 'AATCTTrGT (A;C-CCCAGTA 1080 

ACCATCGGCG ACGGC:GCGTA TACC :;GGGCC GGCAGAGTGG TGCGGGAGGA TG7CCCGCCG 1 H 0 

Gf;GGG(^CTGf-; CAfrrGTGGc^- f";Gc;Ti :cGr:AA r Ij7: 

G' ' .1 rJKORMA': : ^^tJ :G -H ;..■!■: V ] [) N' G : f 'G 



wo 98/16645 PCT/US97/I82I4 

179 



SKQimUCY. CHARACTKHlSTiCS; 

(A) LENGTH; 14 3 9 bcise pairr. 

(B) TYPE: nuclei r acjd 
iO !;i'KANUt:[)NES.S : smqjo 
(13) TO PO LOGY: Linear 




wo 98/16645 



ISO 



PCT/US97/I8214 



aagaca:a(;gg c^;>:;taccc;c;a g(;aaaaa.:a"^: (X:aaca;ggg:; gcacc;ggg-g ■(;tgggac(:c3 

GCGGCGCCAG GGGCVVAGGGG GGCAACGGCA GGAGGGG'I'G( ' CGGCAGGGGG AC:AGGCGTCA 

TGAACrGTGAG CG''^C(^GGGAG GGGCGGA.AGG GCGGG/VATGG GGGCAACGGC GCJG.AAGGC^G 
(/A i Ni-'GKMAT.l ON GiA,) I N( A 1 r ^ : 

, ; ^ :''}-\n)\:U'-i: ■.■:i/.i'/u-';r:;GA;': i (a: : 

A; - :,Kr:. ;Ai; : A T..i: r - 

f 




] J20 
1380 
] ^» 3 



wo 98/16645 PCTaJS97/18214 

IS] 



( H J TYI'K : r;i]c I fie .ir i d 
(IJ) T()rOLOC.;Y: linear 




wo 98/16645 PCT/US97/18214 

1X2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 baso pairs 

(B) TYPE: nucleic acid 

(C) S'J'RAN[)EI)NE:^S : r. i nq t 
f !)} TOLn)I.0(;V; 1 1 tK-Ji 



AT- :A'r^"Y': ;ac;aatcc; 



■;a^t;acctac 



;a'j^ 



wo 98/16645 PCT/i;S97/18214 



GC:C7Gi]GGG GP^CCGGGGGC GhGGGCGGG/\ GCGGCGGGGf; Ti]GCGGCrTG TTAATGGGCG lOO 

CGGGCGGCGrj CGGGCACGGJ' GGCACTGGGG CX'GCfiGGCGG IGGr.'GC-TGTG GACGGTGGCG '/Ai) 

g^gc'"'G(;ggg I ;(.;rG'';c;cGG(; gggck;gggga aggggggggg ggggggk^aa gggckx'CTgg ^oo 

gg":t" ",G']GG '^g:g'gg;'aag :;g::":; :A.g"g. ' G'./g. ;;" :a.:g: ; ''G'GGi^A'Jg; GGTcrj.'GGGi ; .^6o 



Gi^ -r: g;-:G';fg }-gi 



T 



wo 98/16645 



1K4 



PCT/US97/182I4 



G(r:':;t;TAAL:G GC"(nK:cAw("G 7u(^ca7cggc gijCC^gcctCK'g A(:;/\(;agggg> ' GGAc:i;;GGGCC 

GGG::CCAAT^ CTAACGGCGC AAACGGCGAG AACGGGGGTA GGGGTGGTAA CGGTGGCGAG 120 
GGGi:;GGC;Gt tiGGGGAATGf ; Gf;GGGt'GGGG G(^GAAGGGGG A^ iCGGiGGC ^(^G G FAGAGGCiAC 18U 
GGGG^XACGG GGAGGGGcV;': C'i ;aG(GKG;c;< ' AAGGGGi-;G'^ } 

; :g I ni'Oi^MAGi o:: i-g^k :g-;g G; :j : : i ; i : 

(A^ 1G:GGT:! : • ikk'-- ; i :.: :^ 



wo 98/16645 PC T/U 897/1821^ 

T'iG(.;cGGCA(; cgc^::(;gcca(; G(;tc;gggctg f;GGGT;^CGG(; AGCGCccGGr g:x:gacgccc 120 

CC^GCCAGCAG AGGTCTAACC GGTGGTACGG (^GTTC(:;CTG;G GGG(^r;CGC;G' ' QCCGTCGGCG ]S0 

GCCAGAGCGr; GAAGC";rrATT Grrr;crG;j-A Tr^AArT^-^r'-r- ^, 

' :g : r^^-^oRN^A'".: y-'R :'■•:(,' "i g :g':: a-: 



gag;:;-- /^AG^'A-'-rAGr- i/r 



wo 98/16645 PCT/US97/182I4 

ISO 



( ^c:;. -AAAC cc]''];^fj\cn:]c ;.;rAc^Ac:AACVi ccac'/acci^/ r:]r^r{.]cc''M?^ AC:'AC/\GGrr; 
gcga^:c;gcgc^ c;gccgCiGggg gccgc^g::;gaa gcgggggaag ggggc;gaggg gcgggcaccg 
gga^X'!:;c;ggg (x^aagaacxI' ^ aag(^o;cgc;ga /G;G:iG;;c;cA;' f ';.;ggg(;gaa/^ Gt.;G(;G( 'acxx; 



:'4 0 

30 0 
3 GO 

^ ■:.) 



.L".i.'..jr . :: : 



wo 98/16645 



187 



PCT/US97/18214 



'rTGGe:GG:;AC CGGCC^GCCAG GGTGGGGCTG GCGGTGGC(:;G AGCGCjGf/GGG (XGGACGCCC 120 

GCGCCAGi:.:AG AC^C^TGT/VAGG GG'1X;G7AG(a; GGTTGGG'I'GG GGGC^GGCGGf GGG(;iCGC;GG 180 

(■Gf'AGGGCGG rf\AC.{'C(:r^'':'T (y'^C:]i):'CCr-\ ';'GAAf V-G' -TG r:r:TGG':vGv - GGGGG^-\A'-G 2^9 

(.G : *G'"0HMAT1 GN ; g fJ.-: 1 / 7 : 

■ : .GG.GIKn :i: ..■;iA!v.--, ■ I KiG / G G.\G 
■, a; L1G:G7;; : ■•■ ■ i: - r i i : 



wo 98/16645 Per/US97/18214 

IS8 



INFORMATION i-'OH T'KO l^'^ NO : : 7 H ; 

(i) SF.QUKNCF. CHARACTERISTICS: 

(A) LKNGTH: 21 3B baso pairs 

(B) TYPE: niicleir acjd 
(f*) STFI/'N:)F:)NRSS : s inn if 




wo 98/16645 



PCT/US97/18214 



'ycG'^^'.vrcp^A a(;(:C)C>::a7(a; c'tcgc7'CjCco C'T^.~i(>agc;cc;'.; cuC^gctcc'^a" TvAiGggcxaai' i 3;'o 

TGGGATCCGG GATGGGGGGG (;GGGAATCGG TGCGGGCGG^ TGGCGGT'GGT GAGATT(:;CGC; l.:iHO 
GGTTAGGGGA GC;GAAGGGGG GG:A;GGGGGG GGGCA;r:v.;G,; AGriGGC^rGGG A7t XU. ; AATriG 14-^0 



wo 98/16645 PCT/US97/18214 



Lys Glu Arq CAn Arcj Leu Ala Thr :]cr Leu Arg Asr Ala AJ a Lys Ala 



6!) 



7 0 



7 5 



no 



Tyr r;ii] v^l As[) Gii^ A^a Ala Tl' r Ala Lea A:;j. Aan Ar 



C;!a C]\' Lh 



\ a 1 . . : a 

] :.^0 



Ala Lla /u ■ . 

1/ ■■ 



a. L^ai = ; . a C i V Aar. v',! 1 



af ' -a a 



an (■ ■ i [ ; A r a 



wo 98/16645 



PCT/US97/182I4 



:.^m; Tr.i .;-r 



Val Ala Val l,y:- Ala Ala Ser ].t.Mi (51 y Cly (.My t;l y (ily Cly Gly Val 

355 360 J6h 

Pro Sor Ala I'l c Lpu Gly Sar Ala I If ; y -ly Aia (]]\) Snr Vai Arq 

37 0 :^7s :-i«f; 



1 "a a ' 



Ala Li.l V 



1 iiji'a. 



I y /i.: .- 



■ ; J r: . r; 



. a:! ; a . ; a : 



wo 98/16645 PCT/US97/18214 

\')2 



A'^J A-;{: (Mi: Lri. A:.j A; ^ i V c; V^ii A-y, A:;i 

Cly Gly (My Arg l.ou Arg Arg Val Tyr Arq Ph.o Gir. GJ.y Cys Leu Va: 
11^ 120 I2b 

V.il V-i' i'h.' r;!v Glr; !M<: ^ci] Arg T : '. > Lulj ; ■ * I,<mj Arg \A, 1 

1 'M' : <■ 



■15 



; A ' r F ' : * - 
17'. 



: I J ^ ; r ■ ] 



wo 98/16645 PCT/US97/182I4 

1 



Jrp Thr :;er Mol Pro St:r Gly ""hi 

n ' 1 ' ' ' 



AAi Arcj Aci (Ay A:;:) Aly r.'.n A: -j 
Arg Prcj Gly Pro Arq Arg Ala Thr 

■ ■ ; ■/ Ar AA* ii : s Ai 



wo 98/16645 



PCT/US97/18214 



A'-; As-! A:-; A:') f ■■.:■] A:>;.> Alri {..Ui A:ip Ar(j /-rrj Vai 

'A) ^^ 

lie: Pho Ala Ala Thi A(n; Vul Ala Va 1 Ar>p Pro Pro Leu Arg Gly Ala 

65 70 7b 80 

G)y C-lly (-; : a A Aj /\:a : ."A r; I 1 >^ Asp A>."i Gly lAi ' Ay., a A.r(j 

•n[: 'A) 

AAi AA/ A' : \' A A:-; At a ''v A:. Aa A-.-: ilia ii : a Ar q iA /; A: a H:a 

l:..a rr> ] 1 0 

'-.y Ai,i ;-.:y y^ A:, J a.; ; Aj;; ,yj. ,j y^^; 



Ar .: 1 y> a. i J 



■ v',:; A:a a,v ar^^ .:-r .as:; a;.; .a r ; Arq Arc] V^i] 'I'yr Ara aiy 

■ ' ■ ■ ■ ^ ' ■ i 7 y 



1 



wo 98/16645 



PCT/US97/18214 



IMS 



AJa Glu Tyr Ar (j Asp 



Tnr- r>-p t; I y (i J u Ph'- Va 1 

7 0 'f[j 

Ar n Arc l.y:; Vai Pro I 1 ^- 
90 



r; 1 a Tfi: Arq Met Loa 
Va : Ara G] n Ar^; A], a 



wo 98/16645 PCTaJS97/I8214 



<—T'^r"Yr-:- : TTGAA' :AAC;A 

: 'ag(:a/\c;aa:. ' gg ^.^a^gtC';' 



■'I A.- ITT' 



wo 98/16645 PC I7US97/18214 

197 



AATcr;(';;';nr;T uCa^^atcaT'.' (^'^^C'J\{\GG(\^. i":G(VYG(:rpi': GCTGGCGr^'i'C] c:r:t;ATC'f;ccG I'jOo 

GAGOGATGAT GGACCGACGG GGACCGGCGA AGATCGTr^CT GGTTGGGATC^ ATGGTGATCG ]b60 

ct(;gggggtt cx;ggagg j'tg G(:gtttgc;t(; i (■c;c-:;rt;(;c7\ agcx^gac^ac: tj-agggattc ig^o 

■i'('Cc:gacc\"g g'"'T(";;;:'aa';/ at 'g;g;;a'] 'g^; g-:a ■ ggg -t* ; :;TGGA':GAi g 't.:ag I'Gtggg it,8n 

7. VvAGf 7iN^ 'A :;','!'::(.:;"■-::';■ 'ArrGA': ■ rr. 'v ■ ■."A G'tagc^ ih^gi 



A- :.i,GGG;:: ] '^r ■ t t .n ; 



. 1 ■ . I 
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^ ■'■■-;tvt7.;.; ;:';.v < ; 'a:^;A'.'- ; AuAa: a: ;:;('gaca;at s.^; 

C;'\A"A ■'•■^Ai A' AC"'";' ;<"AA7\ 'A rAA^'TAr'^Ar ; A'IA"i'; 't:'!'-/:: ; AA'r^^; 'f^ f Ar:TAi"i"AA "TA CCi'' 

cacca^i:ggtt gatcgatcca; tgggataa':a; atgca:gccg7 ttaacgtaac agctttcgcg 660 

GGAGCGGGGG TCAGAAGGGG CGAGTTirrCG GGCCGGGAAG CCGGGAGGTC; TGGC;GGGCGG 72 0 

Ti.'ACGGGAGG TGAf/TrvArc;:' A' •GATG' Alf^G TG'IT:GG'rGAG G'TIA^GAAGTA AAAGA^AGTr" 7h^; 

AA.;riGr'AA^"fi GTAA '^rrAG : 'A- Vv'"'GAG' : ' AA' G AAN: 7A ' G-CAAT(^ C:G'rAC^";TT]V' ^ 

- A :-f.,A. ^-i ■.: i : :'.:g;,;v. G' 't rr;.A/..': v ; iGAAGGc:GA': 91A1 

^ TA ■■";Ai' ' a: - ^-A'-^AGIAA.; A A \ ; TG. A : ^ ^r::Gh^]:Vr']:VV ('( IGTGAAGGC^ MhC^ 



wo 98/16645 PCT/US97/18214 



{XI} r^v.QUF.hlCK UK.^CR] i'TTOfJ : SKC.) ]M T-'O : 1 8 b : 



■ ":'Ar-:c^;';A 



f 1 n 



:ac:aa(^ 



wo 98/16645 PCT/US97/182I4 

:()() 



(xx;c:cT::GrT cTTCccAJhc 'rcccccGccc ccr;c:rTc:GAc c-GCACiXx;;:c ctglx^gtgtg 

TCCGGGCTGA CCACCGGGAT CGGCGAACGA TGGGAGATCA CXTC:GCAAT':5 ATCCACCTCG 

^'GCAGCTC^GT CACGGAGC:ga ^TT^GGGGGi'G TG''(;AGAG'':G Gr"T.;GATCv-^.^ GTTC/n'AT^r; 

GGGTGGGG(Y' rxVv ;GGG: -a:: GAAG'I'TG^'AG TA^'TGGr^: X:A (:GAG''TG;:Ti A. 'rr;c-;AGGC; 

gaga/\g'['tga gGv ;'iG/iAG- ■ -"'i :Gr/\':\.;T: ■! :A' ;< ;'t Arrr'i'GAG'"' ; ■ ;aaaa' "f 7\g/\ 

. ■ ' ''A ' ~G':;cA g; A'GG^i - 'Gr;i.; 'gg '':;gcag a.a;agaa/\:'G gga^ ;g>.\;g'-^ atacuITGAC; 

■ i.y.:: : A! : ^■/i h tr,-' ;m ■ 
GA TVIGA :r.-A-::.- .-G.i 

GGnA*G)F:r^::F:G:- ; ^ : A: 



wo 98/16645 



PCT/US97/I8214 



i r ( 1; 



][ Y L c u A I a '"i 1 y : . y s G 1 y I. y j 
20 



:n lie A:-:n 'I n r 'V)::' Leu Aim oer Leu 

3 0 



S(>r Glr: Ala L^.-a A-;i /VJ m I. -a: 



■Ti r; 1 u ''My Arq My Asp Pho Pho Ale 



M : a ^ : 1 i. A.q. Aa; 



m; /-a: Aiq 



1 ■] : 



i-la. Aap 



^rra Laa: I., 



fa' raa 



1 ■] ' 
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iily '?\:: Pi:' (;;y i'lc^ Pi o A^;n Ala Tyr /W:f.' (.j 1 u Tyi Vro 

310 31S ' 320 

Va 1 Lou Pro Pro ilc Gl y Lou t.ln Ala J^ro Gin VaJ Pro JJe i'ro Pro 

32S 330 33S 



3 



\: ;\:;m PL 



;ii: P:io i.fu 



: 1 .) P: 



■1 ,'. P: P 



^ o 
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I'h- A:.:i I!:, S-r \i ['.•■ A i ,i Tii: AAi l.y., \' ^ • [ A 1 -i 

^^V. oOO 60S 

I'hr Pro GIiJ Ser Gly Aia Gly Arq Gi y Aid Aid VaJ Asp Pro Sor Sor 

610 61b 620 

Pro A: (mm Tl.i A:.wi Pho Aid A. Pi Gin P^mj P'-u H.i<"; P^-j G(>r 
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) l-> I/: 
: Ni'ORMATiON yon [;y.o id Nori'U: 
(1) SEQUENCE characteristic:;: 

(A) LENCTi^: 8 0 imino arnd.' 

(B) TYPE: .imirio acid 

(C) STRAN:JF:DNEr:', : 

( :c T( )r;-:.oc;v : : ; lu-i r 



wo 98/16645 



PCT/US97/18214 



A i 'v' a ; A ] r.i } ' r o G 1 r . 

bO 



Pro Gin cau Vai VaJ Val Leu Ala Wis His Lgu Va 1 Thr Gly Thr GJ y 
65 70 7') 80 



(Au V,.] .Ai. ^Ay 



.'-J ! A; -p I'l t 
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(A) LENGTii: 82 1 ba^ie pairs 

(B) TYPE: nucleic ocid 

( C ) S T R AN D E D N E S .S : s i n q 1 c 
(I) J TOPOLOGY : I i near 




wo 98/16M5 
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: ■7:-, vx;rv;A': ..; 1 v " a"\,a; ■ i- ^;wv'i^r■■ v^(;T'v;;-;vv,r; : 1 ;,v\(.;atatc(; nr> 

I^Al:TTTGTuG TC:C(X^GTGGC GGGATAGAGC ACGTG'I'CSGG GTTGGTGAGG GTCACCCGTT 120 

CIGTCGGACGC CGAAC( :CA7'c; GTTTG7viU;G 1 AG;.x:7'GTG(;G TGACAGAAGT GGGGAGCGTA I 80 

aggtgacggt cAAA'^'A'rr'G^' GTGGAATTTc ( ;g : 'r;T( ;a( "f ;t TGf'(;;Gr(:r;r\; gagaatcaac; ?a() 

GGA'rA^.:TrAG G':'A^"VGG-GGA 'AGGTGG/. GG ^GGI^-AT : V-'^-'TTGGG : -TG<;TGAAGG U)- 

■. '^V^'^GGAl "AA IGAAG.' V'/;. " ' ~GG7/\' 7G^* 'G : "GGT ::r[.;G.' ;a GCG^GGGT'SG" GAGG7^GGTr:c; 470 



:gatt 



IGC;. i'-G rr'r,A 



\AGAGAGG'- '1 AT: -^fGAGAAT 

;A'"'A'r7''':^" GA:.''"rGA/iGT 



wo 98/16645 PCT/US97/18214 

:()s 



cr\ci;::Acv;-r i\r:\]cc': ::(:(] ^:(:f:ACA:ccjr ..'gacaccaag cga.;c?c;ccg ccggagocac 300 

CATCACCACC TACGCCACCG ACCGCCCAGA CACCAGCGAC CGGGTCTTCG TGA7VACGTCG 360 

CGGTG(XACC^ ACCGCCGCCG Ti'ACCGCCAA GGCCAGCGGC /VACCCCGGGf; CCGCC:AT(;CC: A?0 

(v;rT^c:{XTv:r r.-;::GTrr;rv\:; ^ :c(v:r:]:::-Gr ;r':*x'GGAA r-AA^ vx/^cgg^- ':(\]g<::.]:^cg(: ^ti^^ 

'■ 'G-'' 'GGGAT- ' AG- vv.''.':G":: : .\ 'G. ;A' 7; . V,';'-,;:-;':'G.; GGG^ 'TGGG.; .; T:"'TGGGt:7'G7' oOO 

■.V^AA'G'G'G:-- GA^G7- 7iG:-.-^ 'Gri.<G*Gi.:-.'., -(';;(■:(; rY-c^-rr-r]-;; C^f rry- ;':-(^aG 660 

_ 0 

t^a/v/'lC-gcgg' Gv'GTG'GiCgg'; gg("^^'ggg':'a:;' ■.'A'.'G'G'7A'"r G'''^:':;;"/":cgg^7 A'''ag'";;T'!'gg oo;^ 



w o 98/16645 
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?()9 



ccGCArcGTc; TCCc;cAc^c(;c; ccac^jtc^a::'.: ^rTTc^c^TCwCr (;(";c7'c;ck:gi,c g^^tgactatg 

GC(X;(:GACGG rc:GTG(;'J'TG"r T^ATTCGGGGC GGGGCGCAGC rTGTTGCGGG AGTTf;AAGAG 

GGr;Ai;GAGA(; GGGc ;AG^' ■"rc^ - :'::'7AG AGGT G'.;('TGAAGTG GGAATr;^:A . GIXX ~( ;OGr;GG 

agat(;aa7ag (;''^;;ga^.'G'/G' : aggtggt': ■ 't t 'C':(M\G'iCC' G':'i:Af'':'GA;. Ai:'TA;;Ac;Gf:'A 



1 M 

1 98 0 
: ' n 4 (J 
7 1 o:. 



"I vg:T' 



."GAGC; 2? HO 



;Jr' ;|<^:A'r ION 



. i . 
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?]{) 



Mcl h]<i AJ a Arc; fW a Ala Va ; Ala AAi G\y Lou Mr- 71 w A^p Ala Gly 

l.iO ].;o 

H:s Cy.s Arq Ala Val Pro Gly Ala CA y Arg GJy Ala Gly Arg Gly 

Mi) :S0 1^)5 ' jf-Q 

iii" A^.f Va.l. Gy^. lao GG/ Gli: Ala Gly Ala Aia Gly Th:. 'I'tir Giy 

A.M AG.t :G-t a:,, y,,; ;;:r: rr.. 'Gy . Ala V.i ; 'rh: Ala >wq ^I'hr 

GG 

^ ■ ' A..: :Gv A, : Gv A. ; Gla A'y- GL: AGi GiJ AGi Aici 

.'■G' .G : ,■:•.) 

'•A, ;'.G} ;:.G^ ;>/^Q 



1 



I 



wo 98/16645 PCT/US97/18214 





VV O 98/16645 PCT/US97/18214 



TT' 'AA7A"'</: • ■: ; A ; : ;aa/\ aa'a ;aa':a aa 'AAT( aa'; aa';'' 'a. ^ -aa^ a> a'aA' -aaac' aaa: 

TAAAAAATc-i; c; ATCA.-( ;aa;\a t: : 'Tcr;r^c^ atc aac/j\catci; GT'rTTGc;AAA cgtca:;c;(:(;aa i'aio 

TAAAACGTCG GCTTCGGGP^A ry:,CGGGCGr\C TTCA/iCCPJiG GC7TTGCCAA CACCGGCAAC 15 AO 

aa(:/\acatcg GGT'i'i ;GccA;i c^accggcaaa aagaacatgi, c;AA'r :x;g.^at gtgaggcgag 

AA' ^'AGCAC;; ; r;f;':?r"-AA':'A*;' TGATAG^AiA" AAGAAATf'GG A' ^AA( 'GG. 'A/^ '^'AG'' AA AA^TC^ U'A^n 

m 

rjGG']: rz-cfv: gg(;ataag(^\ G',gg'^agaaj g^-."'at'~gatg ■r'^'rGG^;r'''A'A c^^^a* A:AT'i'G :g 



wo 98/16645 
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./) .:7Kav:'k::n]:::;;- : 



.1 Tyr A:;:, r;]; 



■ : ■/ A I 



w o 98/16645 



214 
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C;.!n Tyi- A:>n Thr (Uy Tyr I.eu A:;i: y,cr Cily Ar^u Ty: A:^:: Thr GJy Leu 

260 ;'65 270 

Ala A.sn Sor GJy A.sri Val Asn Thr Gly Ala Phc Hi- Tbi Gly AbT, Phe 

77^> ?R0 28.^ 

Am! Giy L-i; Trr Ar^i .-My Asp Gin G:y G-'i i\c i^h^- Giv 

:gG' gg.) 



T:.i G.i.: JG y Th^- A:;r: K"' 



Am. ■[ : G ■/ G^t A:-r, [■l"t r; ; ^ p}^^. 




wo 98/166*15 PCT/US97/18214 



^V. S6( 



The Asn Scr Gi y Thr A.sn Ar,n Va,! C; 1 y I 1 Phe Anr; A.l a Giy Thr GI y 

bbb 5 70 

A!-: Vai (;■■/ A:,. A-:: ;;-■>■ uiy '"n r A.;r: < ; : y 1 <]'y Ar^n 

'?^it^ '-r^^^ 5=^0 



• 1 y [; 



Asr 
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H;-i') HA'] 

Tip Aid Asu Leu Gly Pisn th-i Va.] r>cr Gly Fhe Phe A:in Thr ^)cr 7'hr 

050 B5o 860 

Vai Asn Lou Sc^r 'i'hr Hrn /\ 1 a A.'iri Val Ser Gly J.ou Asn A;:n Ilo G] v 

Bob fWtj r,7S ^^^.['^ 

TrW A5;r. .:.;m .,ly -.a, ^ {;,.. - :;;y i.^-,; -pj^^ , ; | , , J.,, 
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^H) TYPK: :iuclei.(' dcicJ 
(D) TOPOLOGY: linear 



: J : ■■ : ; 



wo 98/16645 PCT/US97/18214 



(xi) f;i::OUh:NCr. DKSCRIPTJON : S^:0 id NO;20b: 
CGATATCTGC AC.A.ATTCAGC: TTTAAAGCCC Aj'TTC;CGA 
(?) ] KrOHMATT ON 1-.^^ f^Kv .] I * rv;):;:nt-, : 



wo 98/16645 PCT/US97/18214 



CTTTCTCGCC A'Xn'TrCCCc; GCTTZCcaw TCAA<:;CTf;TA AAT(.X^0;(;GJC TCCCTT7^AC"G 180 

GTTCCGATTT AGTGCTTTAC GGCACCTCGA c:CCGAA7JW\ CTTGATTAGG GTGATCGTTC 24 0 

ACGTAGTGGG GCATGGCCG':' GATAGAr:GGT ';'rTTrr;rr;-r TTGAf'^GTTG^; AGTrCACGTT 300 
GT?TAATAGT GGAf":'!': 'TIG': : 'AAA^ '7; ; : ■ /^A^'AA* "A' 'T ' AA^ "G ."GAG^ 'T ■ 'CiGTC'rAGGC 

t?tt(;at7':'A ':v\aga(;a?'g:- tg""gat': tg : igc'tattug tgaaaaaagg agggc;attga -i.'c 

■G^G.;-:.-:AAAT -T: ;G7r-r;r;AA rr:-':f:[-y": : ■■-'yjY:T'-r':'^ TAAA'1GG'AG7 -'AAATATCrrA S.^.) 

T ^■G:"7^G7 AAT'i'AAiG' -7 ':A7AAAAA:"i : •A7l:-- :;7GGA7 G/w\A'l 7AAA^ ' 7G7AA7TTA^r t.l)7 

'7."7^7A':■G;v 7; A7TA7( V77:'A ■77^7Ar7'7r7 ; G^AAA/^-G". \ ■ r '77:"" r7'" A;..'i' 7AA77AGAAyi t.bO 

7:77G\AG;'',77 ;G\7 AG^G^GC: G A77AA7r7':'' ' ' '7^;7; GAAA AA':7GV77i7A '.G^^AGGCiAC Gi /MQ 
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220 

A- ■.: .;r(;Ai A a^tcw?:;'! ; 't 'rA^A;i;Ai.r:": v: ..".A^^rcAAAAA i^ataai'Taa- ^ - iataal;*;;^'' - : -of! 

TAncr^GTCGG C^C^TCAACGnO GGG7TCGTGC: ACACAGGCCA GCTrGGA(X:G AACGAC^CTAC 18hO 

ACCGAACTGA GATACCTACA GGGTGAGCTA TGAC'AAAGCG CCACGCTTCC CGAAGGGAGA 1920 

A/^GG.JGGAGA GC^TATCXX;:"/! /iAv^GGGJ.JAGC-; GT^G ;gAA':: AG rV^GAf- 'G^G/\' ' GACV:^:;AC'^G TT \360 

■> G ':"j'Gr/V'/ GG'1G\-GG> .(;G ' -riG-rGG':'-' ^ r. ■■[■']"rT:-,-rr^ A'^h^: VVT- '[ 'l' ^(-y ^."-CVVT P-. -^'rn 

v :tc;att ^'^iggggataa ;' G;TATrA"':" ;:'^"G'i gg^ ;ag': f :AGG'rGAT*.'*GG^GG^C'^G"'G'' :ggg) 

TATT-'GG'l'CG' ;'GAGGG/iG: ■': GGG::-: ■:;GAGG TGA^ "AG- 7* TATAGGGG.. , ; A'"^GG':': ' Ai TI'/v .\G10 



'Tga: 



wo 98/16645 PCT/US97/18214 
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r.r.GCAAC.ChG CTGACTGGG'l' rGAAGC;CTCr CAAGGGCATC (;GTGGAGATG CCCiGTGGCTA 



CCTGTCGT(X CAGCTGGATT AATGAAGGGC-i GGAACGCXX , (;GGAGAGGGG GTTTGGGTAT 



T(;Gc;rc;f.:'"AG r^'^T' g^':': t'g: G'r''GG':'AGGA -ggagagg 



f \ r\'r 



AGACAGAAG7 TAAT(;GGG:^:' G ::TAAC:A(G\. • GGATTTC^^'TG GGGAGGC^AA'! Gr'GAG(^AGA': 



m;g:.^ 



ATGAGTGAGC TAAGTTACAT TAATTGCGTT GCGCTCACTG GCCGCTTTCG AGTGGGGAAA 3h4 0 



.^600 



g-:'Gv"Ac:ggg ggtg';\;g('- 'g AGfr^AGGGCiAA 3'.';:() 



'1 0 S < 



atgg-;tgg'-t ,11^0 
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mi 

;^\?rAAT/\;^;; A'"TrA('TATA AAi ;>.;aatt-':' -a:- ;a7AA aaa7"A'a^ ''ta^' aaata;^ ^-jac 

:'77TC7rTTAA crT'rAAC;AAv; OAGATA'l'A'. 'A TA'l'.^GACCVv'I CATCATCAT'. ' A7CACGTGAT b ; j C 

CGACATCATC GGGACCAGCC CGACATGCTC^ C^GAACAGGCG GCGGCGGAGG CGGTCCAGCG 5160 

GGCGCGGCAT a:;ggtgga^^g acatccgcgt cgctcgggtg attgagcagg agatggccgt ^■>2;^o 

GGAGAGCc;rr G(^CiVAC/\7CA cgtagcg^:a 1' gaa';;c7'Cgaa c'rGTGnTT:'A agataaggg: ■ ;.,:ho 

■ iGGvK:AArG-;j A-;ggu::T' ;a /g^g(^ag(":;a'; ggc'tt; 'gc":t >SAAAt.";GGG^":>.5 cgg^sggggG'^; ')3A0 

':'A■::TC■T^A;G:; a::':'A'-'"'."''" " ' '-GA'CTC^ir : y/]": ]r/^y:'u;^ sgggaga';:'-.; (JTAGGAGGG'!' ';40!"} 

G:.:'j'7iA; x-; CG'^Gr rc/vAf t 'r; ;( ;;;rG'Gf' 'ttt^'A" < ;a; .At-iGTA': ' r:r;AAf^' ttt^a^^ ^i-n.r^ 

>*AAGA'i'T c:G"Tr '.'";a: ■■ : ^ :gta';^''T^1': ' /r-;,/-y-^-':''w^:i at i i^'^'G ■ : ■ ;'.c'Aa:;gc;g^""; 

ggagga',:gtg a.a';(":'\-aa^ ■■. ga/^aa'.'T'::";' •c";G'"'VG'7 7A':.'*:a':c-G':"a ggat^aaaag ^■G)0 
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C'k^c(]'::::Gcr AACACCxXiA ATc;:^i:.:cAr;c'r (;v;r;;:GATcx'C aacgcagcac ctcc :;cccgc 

CGACCCGAAC GCACCGCCCIC GACCTGTCA7' TGCGCCAAAC GCAGCCCAA.C CTGTCCGGAT 

GGACAACCCG GTTGC^AGGAT 7't:A; K:TTCG(; C>(:TGCC^TGC^T C^Gr:TGGr;TG:'; AGTCTGAGGG 

CG(;'^GAc?':';; gact/igggtt *:'Af ;c/^gtv''.^g gac;gaa/\agc agggCjC^c^agg gg("ga':^ttgg 



. / '.L I AC ■ \'''j^J'\j\ 
y ^T^V,;;Ar-Ar 
!■ 'TGCGTTGA 



() 7 8 0 
6840 

6 90 0 
(j hO 

7 02' i 

7 j -1 G 



■gt; 



■ :a'i ■ 'g;;;^ 



717,7 ;.;aT' 
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224 



PCT/US97/182I4 



■•'vir A:'!' i:^^ A:-:} 1 A?"'.j V.il ll^- w . ij A.;p Mr^t 



A.Ki Veil Asp Sei A 1.1 w]y I.ys I .1 o Thr Vyr Arq I i (; Ly:^ ]a-u Glu V<iJ 

50 60 

Ser Phe Lyfs Met Ar^ Pro Al- Cln Pro Arq Cly Scr Lys Pre Pro t]er 

65 70 75 HO 

'■\y Pro 'Pu T'.r :Py Al i My Ai.i (Ay Plir \P; : A... TPr 'iTo- Pin 



wo 98/16645 



J •} 0 

Cj 1 y Pro Ala P r o As p G 1 y 7 y r P r o 
V,-i ! /.s:, As;i Ar : i.ys /.l 



PCT/US97/182I4 

A.:ii I.; 1 :. A . ,1 1 J .i.M : I Asp 

3'^..- 3 0 0 

I J.? lie Asm Tyr Gi'^ Tyr Ala lie 
3 65 

Ai a 'I'll I. A.J G 1 M 'I hi :am. Gin A ! ri 

r: : ' M ^ V,; : ; ; ,y:. p..ni G».; : A^jp 
A. . i . r:- ■• : v.: . :r. ;-:p A: .> A A. 

.-■■r A: I A 1 /I g: y J--:-: Gin Gly 
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226 



PCT/US97/18214 



r ■ 1 O 



Arc] Me Veil Leu Gly A: g Liy.} A;^p G 1 ii I.yf. Leu Tyr Ala S^l A.l.i Glu 

^'j^^ C^50 G55 

Aid Thr A:;f:' :;or Ly.s Ala AL^j A: a Arq Lea Cly ^;or /u^p MtM. (; I y Glu 

bb^; f,7C 



"■/I ia^; 7y: la > ) G!y 'I'n: Ara J-'" /^ar: Gl: 



f . a i I ; !" va ] I S f - 



:r .a^ 



a ■ 



7^0 
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CLAIMS 

We ciaim; 

1. A polypeptide comprising an antigenic portion of a soluble 
M. tuberculosis antigen, or a variant of said antigen thai differs only in conservative 
snbstitutions and/or modifications, wherein said antigen has an N-terminal sequence selected 
IVoin the group ct)nsisting of: 

(a) Asp-Pro- VaI-Asp-Ahi-Val-Ik'-Asn-Thr-Thr-C\\'s-Asn-ryr-(iIy-Gln- 
Val-Val-Ala-Ahi-LcLi (SflQ II) NO. 115); 

(b) Ala-Val-Gln-Srr-Cily-Mct-Lcii-Ahi Lci; (iiy-Tlir-Pru-Ala-rro-Ser 
(S1*Q ID NO: I 16): 

(c) Ala-Ala-Mct-Lys-Pro-Arg-Thr-(;iy-Asp-(ily-Pr()-Leu-CiIu-AIa-Ala- 
Pys-(iIu-C}Iy-Arg (SHQ ID NO: 17): 

(d) lyr-'ryr-Trp-e;vs^Pr(>-(}!y-(iIn^i*ro--Phc-Asp-Prt)-y\la-rrp-(ily-Pro 
(SP:Q II) NO: 118); 

(e) Asp-Ile-(]ly-Scr-(i]u-Scr- l hr-(ilu-Asp-C;]n-(iln-Xaa--Ala-Va] (SHQ ID 
NO: 119); 

(fl Ala-(Hu-(lIu-Sci-Ik-SL-i-ThiA'<ia-(ilii-Xaa-IIc-Val-Pr(i (SPO ID 
NO: 120); 

(g) Asp-Pn>-(iki-Pr(>-A!a-Pr(i-Pi(>-\'aPI^o- ^llI-'^lH-Ahl-Ahl-Scr-P^>-Pr^>- 
SeI■ (SIX) II) NO: 121 ): 

(h) Ala-Pro-Pys-Thr- l yi-Xa;i-( ihi^( iiu-I eu-i ys-(il\ -'l hr-As]v riir-( ih 
(SI'O ID y. 122): 

(It A>p Vvo Al:i Scr- \Ia Pro \-:p Wi! Pi.^ I in- \la-Aki-( ilii-j ^-ii I hi-Scr 
1 cii'l cu-Ami-Sci -i .'u-ALi- Ami Wil Scf Phc \Ia-A'.;i (SI (2 

il) NO: I2;m: and 

nt \l.i-I*f o-( i]u-S.T-( i!\-A!a t iK -! cu ( \ il v- i hi A al-( ,hi-Ala-( il\ ; 
iSf (2 11) \ u ! ^1 ) 
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2- A p()i>'pcpliclc comprising an ininninogcnic portion of an 
M. tuberculosis antigen, or a variant of said antigen tliat differs only in conservative 
substitutions and/or modifications, wherein said antigen has an N-terminal sequence selected 
from the group consisting ol": 

(a) Asp-Pro-Pro-Asp-Pro-nis-Ghi-Xaa-Asp-Met-fhr-I.ys-C}ly-7yr-Tyr- 
IVo-(lly-C;iy-Arg-Arg-Xaa-Phe; (Sf:Q ID NO: 124) and 

(h) Xaa-ryr-Ilc-Aia-'lyr-Xan-Thr-'J'hr-A[a-C;[y-IlcA^aI-Pro-C]Iy-fys-lle- 

AsnA^al-liis-fcu-Va!; (SI{Q ID NO; 1.^2), wherein Xaa may be any 
amino acid. 

A polypeptide comprising an antigenic portion of a soluble 
M tuhcnulosis antigen, or a variant of said antigen that differs only in conser\'ati\'c 
substitutions and/or modifications, wherein said antigen comprises an mnino acid sequence 
encoded by a DNA sequence selected from the group consisting of the sequences recited in 
SfX) U) NOS: 1,2, 4-10. I V25, 52, ^4 and %, the complements of said sequences, and DNA 
sequences tiiat hybridi/e to a sequence recited in Sh;Q ID NOS: 1. 2, 4-10, 13-25. 52, 94 and 
or a complement thereoi" under nuHleratch' strinnent conditions. 

4. A polypeptide comprising an antigcriiL portion df a M luhcrculosis 
antigen, or a \'aiianl of said antigen that dilfers onl\ in conscrvatne substituiitins and'or 
modifications, wherein said antigen comprises an annuo acid sequence encoded b\ a DNA 
^;L\!i]cncc selected \vo\\\ the grou^^ consistiiii' ot tlu' sequences reeiieJ in SVi) ID NOS: 2fi ^1, 
l.vv I-f l>S r'S and the eunlp]ement^ of s;iij veuiienee^. aiid |)\A ^^eqiiences that 

ii.v;^: uii/e li> a seqiienee recited i:i SI (,) ID .\()S: .'d -.^ I . fvv i-;, I >N ] and or a 
conit^lcment ttiercof under modcrateiv slriiment conditions. 



\ I)\ \ 
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0. A rcconibinaiU expression vector ct>mprising a DNA molecule 
according to claim 5. 

7. A host cell transformed with an expression vector according to claim 6. 

8. '^rhe host cell t)rclaini 7 wherein the host cell is selected from llie group 
consisting ol7';. coli. yeast and mammalian cells. 

A mt'thnd for detecting M. tukcrculosis iruCxtion in a biological 

sample, comprising; 

(a) contacting a biological sample with one or more polvpeptides 
according to any of claims 1-4; and 

(b) detecting in the sample the presence of antibodies thai bind to at least 
one of tiie polypeptides, thereby detecting M. tuberculosis infection in the biological sample. 

If). A method for detecting M. tuberculosis inlection m a biological 
sample, comprising; 

(a) contacting a biological s;iin[ile with ;i pol}[x*ptide ha\im: an N- 
(crnnnal sequence selected from the unnip coiisfsimu of seiiuences piovided in SbXJ ID NO; 
129 and 130: and 

(b) detecting in the sample the [presence of aniiboLiies thai bind to ;it least 

one of the pol\ pe[Midv'>. iherelw detcclmL' ;/ tuhcriiii>Ms inlcclion in (he biolomeai sample 

11. .\ method j.>i delcclnij: M tuiH'>\ ia<>^ : ^ itiloction ni a huAo^^\cA\ 
sample, comi^nsmii: 

(a) eoniactinr a bioio;'k\il s.unplc witli one nr more poKpeplides eiieodcti 
i\\ A n\ \ SL'i{iieii;e --K-chvi Irom \\k j'nuip l^himsIui!' ol S| (j Ij) \()S- V II 1 \ 1 1 "Wi. 
h^ M ^ \ I K.M SS. i \'y^ mk\ I '^S. I tie . ^ MiMiieiiienis J m\imi j-J i )\ \ ■ mi ^ 
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(h) detecting m the sample liic presence ot' antibodies that bind to at least 
one of the polypeptides, thereby detecting hi, tuberculosis infection in the biological sample. 

12. The method of any one of claims 9-11 wherein step (a) additionally 
comprises contacting the biological sample with a 38 kD M. tuberculosis antigen and step (b^ 
additionally comprises delecting in the sample the presence of antibodies that bind to the 
38 kl) M. tuberculosis antigen. 

13. I'he method of nny one of claims ^>-l 1 wherein the polypefuide(s; are 
bound to a s{)lid support. 

14. Tiic method of claim 13 wherein the solid support comprises 
nitrocellulose, latex or a plastic material. 

15. I he mcthtxl of an} one of claims 9-1 1 wherein the biological sample is 
selected from the group consisting of w^holc blood, seiimi, plasma, saliva, cerebrospinal tluid 
and urine. 

16. The method oi" claim 15 w herein die biological sainj)le is whole blo(Kl 

or serum. 

17. \ inetliod lor deleeliui' 1/ iut\'rculnsiK iiitection in a biolomeal 
.-ample, eompnsing: 

(a) voiilaeliiig tlie .sanipJe \Mtii at lea.st olijionucleolide fM-irnei"> ui a 
polynier.ise (.liam re;ient)n. wherein at lea.st one ot the oligonucleotide pi'iincrs is s])ecilic lor a 
1)\.\ molecule aeeoidiiu' to el aim aiul 

(hi deleelMii' in ih.' sainple a ! )\ \ NLnjiuMiee lli.il aniplit'ic^ in ihe piesenee 
ihe oi]L'oniieie<UiJe i^iniei:. 'heiei^v Lie!eeiin:' '/ '/< 'v m f^/* .'^/ s ifilei tioii 
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18. I he rnctiiod ot claun 17, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a DNA molecule according to 
claim 5. 

19. A method for detecting M tuhercidosLs infection in a biological 
sample, comprising; 

(a) contacting the sample with at least two oligonucleotide primers in a 
polymerase chain reaction, wherein at least one of the oligonucleotide primers is specific for a 
DNA setiuenci' ^clfrtcd fr^Mi^. the group consisting of SI-Q II) NOS. 3, 1 1, 12, 135, l3(), IM- 
135, 184-1S8, 194-195 and 198: and 

(h) detecting in ihe sample a DNA sequence that amplifies in the presence 
ol'the lirst and second oligonucleotide primers, thereby detecting M. tnhcrcidosis infection. 

20. 'I he method of claim 19, wlierein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides ol a DNA sequence selected from 
the group consisting of Sh;Q ID NOS; 3, 11, 12. 135, 136, 151-155, 184-188, 194-195 and 
198, 

21. The method of claims 17 or 1^) wherein the hic>IogicaI sample is 
selected from the group consisting of whc^le blood, sputum, scrum, plasma, sali\'a. 
cerebrospinal lluul and urine. 

2? \ nicthod to[ vlL'k\'t]iii' \! '7 ■/\ ///os ;s mlcclio!! m ;! ^Mohii'jcal 
-^,111 ^oMipi ismg: 

(a) coniactmi' the sanipie with t>nc or moic ohgotuiclcotidc pi'obcs spccilic 
lot A I )N,\ iiiolcculc according lo ciaiin ^. .iini 

(b) Jctectiiig ill \\w ^AU\y\c a 1)\ \ M\]uctuc lh;il Inbiidi/cs i.» tiu' 
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2?>. The method ot" claim 22 wherein the probe comprises al least about 15 
contiguous nucleotides oi^a DNA molecule according to claim 5. 

24. A method for detecting M tuberculous infection in a biological 
sample, comprising; 

(a) contacting the sample with one or more oligonucleotide probes specific 
for a DNA sequence selected from the group consisting of SMQ ID NOS; 3, 11, 12, 135, 136, 
151-155, 1S4-188, 194-195 and 198; and 

(b) detecting in the sample :\ DNJA sequence that hybridizes lo ihe 
oligonucleotide probe, thereby detecting \f. tuherculosis infection. 

25. The method of chum 24 whcrem the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a DNA sequence selected trom the group 
consisting ofSHQIDNOS; 3, IL 12, 135, 116, 15M55. 184-188, 194-195 and 198. 

26. The method of claims 22 or 24 v\hereiu the biological sample is 
selected from the gnnip consisting of whole blood, sputum, serum, plastua, saliva, 
cerebrcxsjiinal fluid and urine. 

27. A method for deleclmL! \1 tuberculosis infection in a biological 
sample, comprising: 

(a) coniacting llu* biolo<:]cal sanifMc a birklini' ai^enl which in ;-apable 
of bind III!' 1(1 A [^ul\prpt idc according to an\ one ot ci;nin> 1-1; and 

(bt vicleciiiu' 111 llu vKUjik' a protein oi f>o|\ pcfMuh' ifiat biiuls ihc 
iMiidiii!:! agent. iheieb\ detectiiu' \/ ////v7Vf//os/s nirt.'clion in ibc bioiogicai sample. 

2S \ inelbod lor dL'leciuiL' //."\ ' . /.\^ o. s nlav tion i;i fih^it mji^-jI 
-ainpiv'. o un[n"isnTa, 
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(a) contacting the biological sample with a binding agent which is capable 
of binding to a polypeptide having an N-icrniinal sequence selected from the group consisting 
of sequences provided in SHQ ID NO: 129 and 1 30; and 

(b) detecting in the sample a protein or polypeptide thai binds to the 
binding agent, thereby detecting M. tuberculosis infection in tlic biological sample. 

29. A method tor detecting A/ tuberculosis iiifcction in a biolouical 
sample, comi)rising: 

(a) contru'ting the b!(^l(^gical sample witli a b.iridiiiij; ayeni which is capable 
of binding to a poh'peptide encoded by a DNA sequence selected from the group consisting 
of Sb:Q ID NOS: 3, 11, 12, 135, 136, 151-155, 184-lSS, 194-195 and 198, the complements 
of said sequences, and DNA sequences that hybridize to a sequence recited in Sl'Q II) 
N0S:3, 11, 12, 135, 136, 151-155, 1S4-18S, 194-195 and 198: and 

(b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby dctcctmg M tuberculosis iniection in the biological sample. 

30. The method of :u\y one of churns 27-29 w lierein the binding agent is a 
monoclonal antibod}'. 

31. Ihc method of an_\' one oi claitns ?7-^^) \\\]vic]n \hr binding agent is a 
pol\clonal antibody. 

v"" A viuiLMioslie kii c^iMifn'isinL': 

(ai one *>r nuMC [^oI\ pofMuk'N ai\'(*i\ii!i'.' to aiiN iM\ !aiiii^- 1 l:aTid 
(hi a Jetection reagent. 



\ JiarnoNtu kn v-omprisnu' 
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34. A diagnostic kit comprising: 

(a) one or more polypeptides encoded by a DNA sequence selected from 
the group consisting of SEQ ID NOS; 3, II, 12, 135, 136, 151-155, 184-188, 194-195 and 
198, the complements of said sequences, and DNA sequences that hybridize to a sequence 
recited in SI{Q ID NOS: 3, 11, 12, 135, 136, 151-155, 184-188, 194-195 and 198; and 

(b) a detection reagent. 

35. 1 h.e kit :iiiy one o[ claim:. 32-34 \sn(j(eiu the poi\'peptidc(^) are 
iunnobili/.cd on a solid support. 

36. The kit of claim 35 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

37. The kit of anv one of claims 32-34 wherein the detection reagent 
comprises a reporter group conjugated to a binding agent. 

3iS. The kit o( claim 37 wherein the binding agent is selected from the 
group consistinLi ot^inti- imnumoghifTuhns, Protem (k Protein .A and lectins. 

3^>. 1 he kit (>r claim ,^^7 wherein the reporter group is selected from the 

i'loup ciuisistmi: ol i.iJioisotope.s. nuore>ceiii LMoup:^, luiiiHieseent gr(Hifv., eii/\ t]ie.>. biotm 
Ave particles 

41) \ viiaL'Moslie kil eonipiiMiij' .i! KmsI two oliiUHHieleol ide [iruuers, al 
ie.l^l o!K' ol the oligv^:uiele(^tKle piitTieis lieiiig Npeei:'ie Itw a DNA niitleeule accoidiug Ui 
eiaini ."^ 
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41. A diagnostic kit according to claim 40, wherein at least one oT the 
ohgonucleotide primers comprises at least about JO contiguous nucleotide ol a DNA 
molecule according to claim 5. 

42. A diagnostic kit comprising a at least two oligonucleotide primers, at 
least one of the primers being specific for a DNA sequence selected from the group consisting 
of Sb:Q ID NOS: 3, 11, 12, 135, 136, 151-155, 184-188, 104-195 and 198, 

43. A di.'ifmo^tir kit according to elain^ 42, wherein at Icasi one of the 
oligonucleotide primers comprises at least about 10 contiguous nucleotide of a DNA 
sequence selected from llie group consisting ol SI^Q ID NOS: 3. II, 12, 135, 136, 151-155. 
184-1S8, 194-195 and 198. 

44. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe being specific for a DNA inolcciile according to claim 5. 

45. A kit according to claim 44, wherein the oligonucleotide probe 
comprises at least about 1 5 contiguous uuclctUidcs of a DNA molecule according to claim 5. 

46. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe being specific for a DN.A sequence selected trom the group ccnisisting 
ot SI (j IDNOS: .V ! 1. 12. r>(n 1^1 1 ix: ]SS. and 198. 

47. A kit aeeoiuiiiL'. elaifii A(k wherein the ohgtmucleotide [^tolu' 
comprises at least aboiil 1,^ cont]L:ut>u^ uueteotkles ot a DN.A sequence selected from the 
I'toiip ciuisistmi' of SI n ID NDS: v If 1 .A 1 >A 1 f 1 - 1 A\ 1S4AS8. PU-l^.-^ and f)8 
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49. A polyclonal antibody thai binds lo a polypeptide according to any of 



claims 1-4. 



50. A fusion protein comprising two or more polypeptides according to 
any one of claims ! -4. 



any one of claims 1-4 and b;SA'i -6 (SFQ II) NO: 99). 

52. A fusion protein comprising a polypeptide having an \-terminal 
sequence selected from the group of sequences provided in SI*Q II) NOS: 129 and 130. 

53. A fusion protem comprismg one or more polypeptides according to 
imy one of claims 1-4 and the M. tuherculosis antigen 38 kD (SRQ ID NO; 150). 



51. 



A fusion protein comprising one or more polypeptides according to 



54. 



A diagnostic kit comprising: 



(a) one or more fusion proteins acci>iciing to any one (4"claims 50-^3; and 



(h) a deleeticMi rL-agenl. 
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